[Corpora-List] Most frequent 5K words in Icelandic?

Tristan Miller miller at ukp.informatik.tu-darmstadt.de
Mon Nov 19 11:13:01 CET 2012


Greetings.

On 19/11/12 10:58 AM, Kim Witten wrote:
> Hi Corpora Subscribers,
> I'm wondering if somebody might be able to point me in the direction to find a simple list of the 5,000 most frequent words in Icelandic, from any (relatively current, non-historical) Icelandic corpus? With English gloss would be even better, but it's not necessary. Thanks!

Wiktionary has a 5K list derived from movie and television subtitles: http://en.wiktionary.org/wiki/Wiktionary:Frequency_lists/Icelandic_wordlist

It is most likely a truncated version of the lists at http://invokeit.wordpress.com/frequency-word-lists/ which include 50K and even longer versions.

Regards, Tristan

-- Tristan Miller, Doctoral Researcher Ubiquitous Knowledge Processing Lab (UKP-TUDA) Department of Computer Science, Technische Universitšt Darmstadt Tel: +49 6151 16 6166 | Web: http://www.ukp.tu-darmstadt.de/

-------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 259 bytes Desc: OpenPGP digital signature URL: <https://mailman.uib.no/public/corpora/attachments/20121119/9aeeef26/attachment.asc>



More information about the Corpora mailing list