[Corpora-List] machine translation

Megerdoomian, Karine karine at mitre.org
Tue Dec 18 15:33:02 CET 2012


I haven't seen any other parallel English-Persian corpora besides the ones already mentioned below. However, I have heard about a corpus being developed by the English department at Payame Noor University in Yazd, Iran. You may want to contact them. Here's the info online: http://www.eurac.edu/it/newsevents/focus/Newsdetails.html?entryid=22181

"Our developmental English-Persian parallel corpus consists of about three million words (more than 50,000 corresponding sentences in two languages). This is a kind of ongoing corpus, that is, an open corpus in which more material can be added as the need arises."

Karine

From: corpora-bounces at uib.no [mailto:corpora-bounces at uib.no] On Behalf Of Hieu Hoang Sent: Tuesday, December 18, 2012 7:31 AM To: Khamesi Fahime Cc: corpora at uib.no Subject: Re: [Corpora-List] machine translation

Hi Khamesi

According to this website

http://opus.lingfil.uu.se/ There are 3 freely available parallel corpora for persian-english:

TEP

KDE

OpenSubtitles

I've noticed other people, especially in Tehran, are also working on MT and collect data, eg.

http://ece.ut.ac.ir/iis/resources.html

Kind Regards Hieu

On 12 December 2012 21:15, Khamesi Fahime <khamesi_fahime at yahoo.com<mailto:khamesi_fahime at yahoo.com>> wrote: Hi, I am student of Linguistics in Iran and i am working on English to Persian statistical machine translation . unfortunately I haven't found any EN-PER corpus except TEP and ELRA . There are many restrictions in Iran(boycott) for ordering ELRA . I appreciate if u can help me in this respect. I am looking forward to your reply. Best regards, Khamesi

_______________________________________________ UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora Corpora mailing list Corpora at uib.no<mailto:Corpora at uib.no> http://mailman.uib.no/listinfo/corpora

-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 7009 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20121218/289b6db6/attachment.txt>



More information about the Corpora mailing list