[Corpora-List] First Free English-Persian Parallel Corpus

Taher Pilevar taher.pilevar at gmail.com
Wed Apr 14 09:10:16 CEST 2010


Please send this message to the list for the researches who are looking for English-Persian corpora:

*First Free English-Persian Parallel Corpus*

By Mohammad Taher Pilevar, NLP Lab, University of Tehran, Iran.

4 million tokens on each side Sentence Aligned Extracted from movie subtitles Text domain: informal/conversational Total alinged movie subtitles: 1600

http://ece.ut.ac.ir/NLP/resources.htm -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 555 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20100414/1f253f51/attachment.txt>



More information about the Corpora mailing list