[Corpora-List] Open-source corpus query tools
david.reitter at gmail.com
Tue Dec 28 19:28:01 CET 2004
you may want to check out the NITE XML Toolkit
This is a much more recent package, based on XML technologies. There is
a nice query language, the representation format is well worked out
(supports time-alignment), there is a useful library with an API for
Java (read / run queries etc.) and they have components that allow you
to throw together GUI based annotation tools. Documentation is
available, too (even though it takes a few clicks to get through to
The above link is at Edinburgh, but the Stuttgart people have
co-developed NXT, as have others.
David Reitter - ICCS/HCRC, Informatics, University of Edinburgh
Blog: http://www.davids-world.com Homepage:
On 28 Dec 2004, at 14:50, Grzegorz Chrupała wrote:
> Hi all,
> I am looking for an open-source library or program that implements
> running queries on annotated copora. I'd mostly be working with
> multilingual parallelized POS-tagged corpora. I found software (Corpus
> Query Processor from IMS,
> http://www.ims.uni-stuttgart.de/projekte/CorpusWorkbench/) that seems
> like what I need, but it is not open-source.
> Do you know of an OSS project that does something like that?
More information about the Corpora-archive