I am looking for a Malay part-of-speech tagger or a PoS-annotated corpus that could be used to train one. Any tips for a robust Malay stemmer/lemmatiser would also be appreciated.
I am interested in both open-source and commercial software/data.
Many thanks,
Viktor
==========================
Viktor Pekar
Chief Scientist/NLP developer
Market Sentinel Ltd. 6 Sancroft Street London, SE11 5UD UK
www.marketsentinel.com <http://www.marketsentinel.com/blog>
Company twitter: @marketsentinel
t: +44 (0) 20 7793 1575
========================= -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 690 bytes Desc: not available URL: <http://www.uib.no/mailman/public/corpora/attachments/20110314/2c352d1d/attachment.txt>