[Corpora-List] quantities of publicly available parallel text?

Chris Dyer redpony at umd.edu
Wed Feb 27 03:50:15 CET 2008


Dear colleagues,

Is anyone aware of attempts to estimate how much machine-readable parallel text is publicly available? I'm trying to get a general sense of the scale of parallel data we currently have (and are likely to have in the future, assuming current growth trends). Does anyone have any statistics on this sort of thing?

Many thanks-- Chris

------------------------ Chris Dyer Dept. of Linguistics University of Maryland



More information about the Corpora mailing list