[Corpora-List] Chinese Tokenization

Alla Shashkina alkasiko at yahoo.de
Wed Aug 15 13:45:15 CEST 2012


Hi Jiajin and all,

is there any good tokenizer also for traditional Chinese?

Thanks, Alla

On Aug 14, 2012, at 2:37 PM, Xu Jiajin <ustcxujj at gmail.com> wrote:


> Hi Ajay,
>
> You can get a copy of ICTCLAS tokeniser developed by Dr. Kevin Zhang at http://www.ictclas.org/ictclas_download.aspx.
>
> ICTCLAS is one of the best Chinese tokenisers.
>
> Jiajin XU
> Ph.D., associate professor
> National Research Centre for Foreign Language Education
> Beijing Foreign Studies University
>
> On Tue, Aug 14, 2012 at 8:17 PM, Ajay <ajay0221 at gmail.com> wrote:
> Dear Corpora list members,
>
> I am looking for Chinese Tokenization and Chinese Lemmatizer tool to tokenize Chinese Wikipedia text.
> Please suggest a open-source, and freely available tool.
>
> Regards,
> Ajay Dubey
> M.S. by Research
> SIEL, IIIT, Hyderabad
>
>
>
>
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
>
>
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 3029 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20120815/6c3bb223/attachment.txt>



More information about the Corpora mailing list