*First Free English-Persian Parallel Corpus*
By Mohammad Taher Pilevar, NLP Lab, University of Tehran, Iran.
4 million tokens on each side Sentence Aligned Extracted from movie subtitles Text domain: informal/conversational Total alinged movie subtitles: 1600
http://ece.ut.ac.ir/NLP/resources.htm -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 555 bytes Desc: not available URL: <http://www.uib.no/mailman/public/corpora/attachments/20100414/1f253f51/attachment.txt>