[Corpora-List] Application for lemmatising corpora
strunk at ub.edu
Fri Mar 23 00:33:00 CET 2007
Maybe the TreeTagger from IMS Stuttgart?
It is available for linux and windows; the output includes POS and
lemmatized text and can easily be converted.
LADA - University of Barcelona
From: owner-corpora at lists.uib.no [mailto:owner-corpora at lists.uib.no] On
Behalf Of Hunter, Duncan
Sent: Thursday, March 22, 2007 11:45 PM
To: corpora at uib.no
Subject: [Corpora-List] Application for lemmatising corpora
Could anybody suggest a small, downloadable and free application for
lemmatising texts? For various reasons I need the texts I am examining to be
in lemmatised form before analysis with corpus tools. It's a small
collection of texts, a few hundred shortish (article -sized) ones in text
I've had some trouble with the software I'm using at the moment. It tends to
'stick' when given a formidable lemma list to process (I'm using Yasumasa
Someya's fairly lengthy one).
All the best,
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the Corpora-archive