[Corpora-List] French corpora for POS tagger evaluation

DJamé Seddah djame.seddah at free.fr
Thu Feb 21 12:06:53 CET 2013


Le 15 févr. 2013 à 10:28, Olivier Austina a écrit :


> Hello,
>
> I am looking for a standard French corpora for POS tagger evaluation. Where
> can I download the corpus please. Thanks.
>
> --
> Regards
> Austina

Hi, you can also get the Sequoia Treebank (a freely available treebank for French, with an French Treebank based annotation scheme).

https://www.rocq.inria.fr/alpage-wiki/tiki-index.php?page=CorpusSequoia

Also, if you're interested on comparing with the state-of-the-art POS tagging of French,

I'd suggest you to sign a license for the French Treebank http://www.llf.cnrs.fr/Gens/Abeille/French-Treebank-fr.php ) and

contact Marie Candito (marie.candito at linguist.jussieu.fr) to get the data set that has been used the most in the literature.

(see http://aclweb.org/aclwiki/index.php?title=POS_Tagging_(State_of_the_art) )

Best, Djamé



More information about the Corpora mailing list