[Corpora-List] Lemmatizing German text for lexical purposes

René Witte witte at semanticsoftware.info
Tue Jan 17 00:56:14 CET 2012


On Mon January 16 2012, Ciarán Ó Duibhín wrote:
> Are there any lemmatized corpora of German, which can be used queried
> on-line or on Windows? I'm trying to lemmatize some German text myself for
> lexical purposes, and I would like to see how others have handled the
> problems, and how well it works.

We have an open source lemmatizer for German nouns, based on GATE:

http://www.semanticsoftware.info/durm-german-lemmatizer

You could use it to prepare your own corpus and install GATE Mimir (http://gate.ac.uk/family/mimir.html) to have a nice query interface.

Best, René



More information about the Corpora mailing list