[Corpora-List] chinese pos tagger/lemmatizer

Marco Baroni baroni at sslmit.unibo.it
Thu Jan 19 14:47:01 CET 2006


Dear all,

Does anybody know of a tokenizer/POS tagger for the Chinese language,
ideally with these characteristics:

- documented in English
- free or cheap
- runs on the Unix command line, more or less out-of-the-box

Moreover, we are also looking for a tool/electronic resource that, given a
tokenized word, would provide a pinyin transcription of the word. Does such
a tool exist?

Thanks in advance for the advice.

Regards,

Marco


--
Marco Baroni
SSLMIT, University of Bologna
http://sslmit.unibo.it/~baroni





More information about the Corpora-archive mailing list