[Corpora-List] Corpus of tagged Romanian

Ruprecht von Waldenfels rvwfels at gmx.de
Thu Sep 10 18:50:44 CEST 2015


Hi, MULTEXT-East includes Orwell's 1984 in Romanian with manually checked tagging. We used this to train a tagger of Romanian, see http://cui.unige.ch/~gesmundo/

Best, Ruprecht Am 10.09.2015 um 03:00 schrieb corpora-request at uib.no:
> Subject: [Corpora-List] Corpus of tagged Romanian
> To:CORPORA at uib.no
>
> Dear all,
>
> I am searching for a tagged corpus of Romanian data to be used to train a
> part of speech tagger. Should you know of resources such as this, please
> let me know.
>
> Shane
> -- Shane



More information about the Corpora mailing list