[Corpora-List] Tools for historical languages?

René Witte witte at ipd.uka.de
Thu Nov 20 18:16:44 CET 2008

Within the "Durm" project we worked with 19th-century German documents on architecture:


One of the tools we developed within this project is a self-learning lemmatizer for German nouns (open source) that can deal with old, specialized terminology:



On Wed November 19 2008 09:55, you wrote:
> Dear all,
> I'm looking for tools for the analysis of historical languages, e.g.
> sentence splitters, part-of-speech taggers, or spelling normalisers. I am
> working on German texts (diplomatic transcriptions) from the 11th-16th
> centuries, but I'd be interested in tools for any historical language, and
> tools for languages that lack a standardised spelling such as dialects.
> Thank you for any help,
> Stefanie
> --
> Jun.-Prof. Dr. Stefanie Dipper
> Sprachwiss. Institut, Ruhr-Universitaet Bochum
> D - 44780 Bochum, Germany
> http://www.linguistics.ruhr-uni-bochum.de/~dipper
> _______________________________________________
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora

More information about the Corpora mailing list