[Corpora-List] R or Python: German lemmatizer / tokenizer / PoS-Tagger?

Vladimír Benko vladob at juls.savba.sk
Fri Mar 25 16:53:19 CET 2016

Dear Hanjo,

> a colleague of mine routinely uses R packages to annotate English texts,
> but has struggled to find any package for texts in German language. Do
> any of you use tried-and-proven packages for either R or Python that
> provide various annotation features (lemmata, tokens, PoS) for German
> (web forum chat) texts?

Python (utf-8) for several languages, and also several other corpus-related tools can be found here:



Vlado B, 16:50

-- Vladimír Benko

Slovak Academy of Sciences Ľ. Štúr Institute of Linguistics Panská 26, SK-81101 Bratislava

Tel +421-2-54431762 Fax -54431756

More information about the Corpora mailing list