[Corpora-List] Application for lemmatising corpora

Oliver Strunk strunk at ub.edu
Fri Mar 23 00:33:00 CET 2007

Maybe the TreeTagger from IMS Stuttgart?


It is available for linux and windows; the output includes POS and
lemmatized text and can easily be converted.

Oliver Strunk

LADA - University of Barcelona

From: owner-corpora at lists.uib.no [mailto:owner-corpora at lists.uib.no] On
Behalf Of Hunter, Duncan
Sent: Thursday, March 22, 2007 11:45 PM
To: corpora at uib.no
Subject: [Corpora-List] Application for lemmatising corpora

Hi All,

Could anybody suggest a small, downloadable and free application for
lemmatising texts? For various reasons I need the texts I am examining to be
in lemmatised form before analysis with corpus tools. It's a small
collection of texts, a few hundred shortish (article -sized) ones in text

I've had some trouble with the software I'm using at the moment. It tends to
'stick' when given a formidable lemma list to process (I'm using Yasumasa
Someya's fairly lengthy one).

All the best,

Duncan Hunter

-------------- next part --------------
An HTML attachment was scrubbed...
URL: https://mailman.uib.no/public/corpora-archive/attachments/20070323/c1984090/attachment.html

More information about the Corpora-archive mailing list