[Corpora-List] Lemmatizing German text for lexical purposes
René Witte
witte at semanticsoftware.info
Tue Jan 17 00:56:14 CET 2012
On Mon January 16 2012, Ciarán Ó Duibhín wrote:
> Are there any lemmatized corpora of German, which can be used queried
> on-line or on Windows? I'm trying to lemmatize some German text myself for
> lexical purposes, and I would like to see how others have handled the
> problems, and how well it works.
We have an open source lemmatizer for German nouns, based on GATE:
http://www.semanticsoftware.info/durm-german-lemmatizer
You could use it to prepare your own corpus and install GATE Mimir
(http://gate.ac.uk/family/mimir.html) to have a nice query interface.
Best, René
More information about the Corpora
mailing list