[Corpora-List] R or Python: German lemmatizer / tokenizer / PoS-Tagger?

Amy Isard amyi at inf.ed.ac.uk
Wed Mar 23 16:15:13 CET 2016


I haven't used the German module of CLipS http://www.clips.ua.ac.be/pages/pattern-de but I have used the English one and it is easy to understand and well-documented - it's written in Python.


On 23/03/16 14:50, Hanjo Hamann wrote:
> Dear all,
> a colleague of mine routinely uses R packages to annotate English texts,
> but has struggled to find any package for texts in German language. Do
> any of you use tried-and-proven packages for either R or Python that
> provide various annotation features (lemmata, tokens, PoS) for German
> (web forum chat) texts?
> Best regards,
> Hanjo

-- The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336.

More information about the Corpora mailing list