[Corpora-List] Downloadable Finnish corpora?

Christian Chiarcos christian.chiarcos at web.de
Tue Nov 29 09:17:57 CET 2011


Dear Laura,

you might want to take a look on the OPUS corpora (http://opus.lingfil.uu.se) which include Europarl, technical documentation, etc. Further, there is a biomedical corpus under http://mars.cs.utu.fi/PPICorpora. And of course you can build your own web corpus of Finnish with toolkits like Bootcat (http://bootcat.sslmit.unibo.it). Finally, Project Gutenberg provides 631 titles in Finnish from which a corpus can be built easily (http://www.gutenberg.org/browse/languages/fi).

Best, Christian

On Mon, 28 Nov 2011 10:50:13 +0100, Laura Lofberg <Laura.Lofberg at uta.fi> wrote:


> Hi,
>
> Does anyone know of any free downloadable Finnish corpora other than
> those mentioned in the Kotus website? I am interested in both written
> and spoken language.
>
> Thanks very much!
>
> Best,
>
> Laura Löfberg
> School of Language, Translation and Literary Studies
> University of Tampere
> Finland



More information about the Corpora mailing list