[Corpora-List] Application for lemmatising corpora

Oliver Strunk strunk at ub.edu
Fri Mar 23 00:33:00 CET 2007


Maybe the TreeTagger from IMS Stuttgart?



http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/DecisionTreeTagg
er.html



It is available for linux and windows; the output includes POS and
lemmatized text and can easily be converted.



Oliver Strunk

LADA - University of Barcelona



From: owner-corpora at lists.uib.no [mailto:owner-corpora at lists.uib.no] On
Behalf Of Hunter, Duncan
Sent: Thursday, March 22, 2007 11:45 PM
To: corpora at uib.no
Subject: [Corpora-List] Application for lemmatising corpora



Hi All,



Could anybody suggest a small, downloadable and free application for
lemmatising texts? For various reasons I need the texts I am examining to be
in lemmatised form before analysis with corpus tools. It's a small
collection of texts, a few hundred shortish (article -sized) ones in text
format.



I've had some trouble with the software I'm using at the moment. It tends to
'stick' when given a formidable lemma list to process (I'm using Yasumasa
Someya's fairly lengthy one).



All the best,



Duncan Hunter

-------------- next part --------------
An HTML attachment was scrubbed...
URL: https://mailman.uib.no/public/corpora-archive/attachments/20070323/c1984090/attachment.html


More information about the Corpora-archive mailing list