[Corpora-List] Looking for Polish news corpus

Lejeune Gal gael.lejeune at unicaen.fr
Mon Apr 24 22:41:23 CEST 2017


Dear janne,

The daniel corpus contains around 450 manually cleaned documents in Polish. You can download it from researchgate : *http://tinyurl.com/daniel-corpus*

Maybe you need a much bigger corpus, If so please send me an email. I have another dataset which is not manually cleaned and not yet available online.

Best regards, Gal

Le 24/04/2017 13:41, Janne Bondi Johannessen a crit :
> Dear colleagues.
>
> Does any of you now of a substantial and downloadable Polish corpus?
> We need it for a project on distributional semantics.
>
> Best wishes,
> Janne Bondi Johannessen
>
> --
> Janne Bondi Johannessen
> <http://www.hf.uio.no/multiling/english/people/core-group/jannebj/index.html>
> Professor, University of Oslo & editor of Norsk Lingvistisk Tidsskrift
> The Text Laboratory, ILN &
> Center for Multilingualism in Society across the Lifespan
> P.O.Box 1102 Blindern, 0317 Oslo, Norway
> Tel: +47 22 85 68 14, mob.: +47 928 966 34, e-mail: jannebj at iln.uio.no
> <mailto:jannebj at iln.uio.no>
>
>
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora

-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 4746 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20170424/dd6faf5d/attachment.txt>



More information about the Corpora mailing list