[Corpora-List] Open-source corpus query tools

David Reitter david.reitter at gmail.com
Tue Dec 28 19:28:01 CET 2004

Grzegorz -

you may want to check out the NITE XML Toolkit


This is a much more recent package, based on XML technologies. There is
a nice query language, the representation format is well worked out
(supports time-alignment), there is a useful library with an API for
Java (read / run queries etc.) and they have components that allow you
to throw together GUI based annotation tools. Documentation is
available, too (even though it takes a few clicks to get through to

The above link is at Edinburgh, but the Stuttgart people have
co-developed NXT, as have others.


David Reitter - ICCS/HCRC, Informatics, University of Edinburgh
Blog: http://www.davids-world.com Homepage:

On 28 Dec 2004, at 14:50, Grzegorz Chrupała wrote:

> Hi all,


> I am looking for an open-source library or program that implements

> running queries on annotated copora. I'd mostly be working with

> multilingual parallelized POS-tagged corpora. I found software (Corpus

> Query Processor from IMS,

> http://www.ims.uni-stuttgart.de/projekte/CorpusWorkbench/) that seems

> like what I need, but it is not open-source.


> Do you know of an OSS project that does something like that?

> Cheers,

> --

> Grzegorz


More information about the Corpora-archive mailing list