[Corpora-List] R or Python: German lemmatizer / tokenizer / PoS-Tagger?

Hanjo Hamann hamann at coll.mpg.de
Wed Mar 23 15:50:25 CET 2016


Dear all,

a colleague of mine routinely uses R packages to annotate English texts, but has struggled to find any package for texts in German language. Do any of you use tried-and-proven packages for either R or Python that provide various annotation features (lemmata, tokens, PoS) for German (web forum chat) texts?

Best regards, Hanjo

-- Dr. Dr. Hanjo Hamann

Max-Planck-Institut zur Erforschung von Gemeinschaftsgütern Kurt-Schumacher-Str. 10 D-53113 Bonn

Tel +49 228 91416 26 Fax +49 228 91416 55

hamann at coll.mpg.de www.coll.mpg.de



More information about the Corpora mailing list