[Corpora-List] List Parallel Corpora with Cronological data

Cam Fordyce camfordyce at gmail.com
Tue Aug 12 09:58:57 CEST 2008


There are starting points for finding existing parallel corpora that I know of.

The FP6 Euromatrix MT translation project has a matrix of language resources for all of the European Union Languages including parallel corpora. See http://www.euromatrix.net/euromatrix

The JRC-Acquis Multilingual Parallel Corpus which is available at http://langtech.jrc.it/JRC-Acquis.html contains parallel texts for 22 EU languages.

Finally, there is the EuroParl corpus which can be found the University of Edinburgh, at http://www.statmt.org/europarl/ .

For the dates of publication, you will need to check each url above.

Good luck.

Best regards,

Cam Fordyce

2008/7/14 bruno cavestro <cavestro.bruno at gmail.com>:
> Hello,
>
> I am looking for an almost exhaustive list of existing parallel corpora.
> + infos on the date of pubblication of each corpora
>
> Best Regards
>
> _______________________________________________
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
>
>



More information about the Corpora mailing list