[Corpora-List] Application for lemmatising corpora

jasper holmes jasper.holmes at gmail.com
Fri Mar 23 11:00:00 CET 2007


You could try WMatrix: http://www.comp.lancs.ac.uk/ucrel/wmatrix/
You need to get a username (one month free trial), and then you do it
online. This does tagging and lemmatising and also some analysis
(frequencies, concordances, key words).

Jasper
http://go.warwick.ac.uk/BAWE


On 3/22/07, Oliver Strunk <strunk at ub.edu> wrote:

>

>

>

> Maybe the TreeTagger from IMS Stuttgart?

>

>

>

> http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/DecisionTreeTagger.html

>

>

>

> It is available for linux and windows; the output includes POS and

> lemmatized text and can easily be converted.

>

>

>

> Oliver Strunk

>

> LADA – University of Barcelona

>

>

>

>

> From: owner-corpora at lists.uib.no [mailto:owner-corpora at lists.uib.no] On

> Behalf Of Hunter, Duncan

> Sent: Thursday, March 22, 2007 11:45 PM

> To: corpora at uib.no

> Subject: [Corpora-List] Application for lemmatising corpora

>

>

>

>

>

> Hi All,

>

>

>

>

>

> Could anybody suggest a small, downloadable and free application for

> lemmatising texts? For various reasons I need the texts I am examining to be

> in lemmatised form before analysis with corpus tools. It's a small

> collection of texts, a few hundred shortish (article -sized) ones in text

> format.

>

>

>

>

>

> I've had some trouble with the software I'm using at the moment. It tends to

> 'stick' when given a formidable lemma list to process (I'm using Yasumasa

> Someya's fairly lengthy one).

>

>

>

>

>

> All the best,

>

>

>

>

>

> Duncan Hunter






More information about the Corpora-archive mailing list