[Corpora-List] Fangorn: a system for querying very large treebanks

Steven Bird sb at csse.unimelb.edu.au
Thu Aug 30 08:21:47 CEST 2012


Fangorn is an open source tool for querying very large treebanks, built on top of Apache Lucene. Fangorn implements the LPath linguistic path language, which has an XPath-like syntax along with linguistically motivated extensions. Result trees are annotated with the query in order to show how the query matched the tree, and these annotations can themselves be modified and submitted as further queries.

Demonstration site:

http://nltk.ldc.upenn.edu:9090/

Query language tutorial:

https://code.google.com/p/fangorn/wiki/Query_Language

Source code:

http://code.google.com/p/fangorn/

Steven Bird and Sumukh Ghodke



More information about the Corpora mailing list