[Corpora-List] Indonesian corpora

Иван Неткачев netkachev.hum at gmail.com
Thu Nov 30 19:50:00 CET 2017


Dear colleagues, I am currently trying to find an Indonesian corpus that would be appropriate for the research which I plan to undertake. I will need to search for particular parts of speech (classifier + noun), so a part of speech tagged corpus would be very useful. If you are aware of such a corpus, could you please share it with me?

Corpora I'm aware of so far: Jakarta Field Station <https://corpus1.mpi.nl/ds/asv/?0&openhandle=hdl:1839/00-0000-0000-0021-10DE-A> Indonesian manually tagged corpus <https://github.com/famrashel/idn-tagged-corpus> (which is too small to suit my goals) Sketch Engine Indonesian corpus SEAlang Indonesian corpus <http://sealang.net/indonesia/corpus.htm>

Thanks, Ivan Netkachev NRU HSE, Moscow -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 915 bytes Desc: not available URL: <https://www.uib.no/mailman/public/corpora/attachments/20171130/5f41eb4d/attachment.txt>



More information about the Corpora mailing list