[Corpora-List] chinese pos tagger/lemmatizer
baroni at sslmit.unibo.it
Thu Jan 19 14:47:01 CET 2006
Does anybody know of a tokenizer/POS tagger for the Chinese language,
ideally with these characteristics:
- documented in English
- free or cheap
- runs on the Unix command line, more or less out-of-the-box
Moreover, we are also looking for a tool/electronic resource that, given a
tokenized word, would provide a pinyin transcription of the word. Does such
a tool exist?
Thanks in advance for the advice.
SSLMIT, University of Bologna
More information about the Corpora-archive