[Corpora-List] Urdu-Hindi Transliteration corpora

Nick Ruiz nruiz at interactions.com
Mon Mar 19 19:26:35 CET 2018


Thanks, Alex! Nadir reached out to me to discuss transliteration mining. I might go that route or try to crowdsource a small corpus, if no other resources pop up. Thanks to everyone else who has graciously replied to me with ideas as well.

Best, Nick

On Mon, Mar 19, 2018 at 2:24 PM, Alexander Fraser < fraser at cis.uni-muenchen.de> wrote:


> Hi Nick,
>
> Maybe relevant, maybe not:
>
> Nadir Durrani, Hassan Sajjad, Alexander Fraser, Helmut Schmid (2010). Hindi-to-Urdu
> Machine Translation Through Transliteration
> <http://www.cis.uni-muenchen.de/~fraser/pubs/durrani_acl2010.pdf>. In
> Proceedings of the 48th Annual Meeting of the Association for Computational
> Linguistics (ACL), pages 465-474, Uppsala, Sweden, July.
>
> Cheers, Alex
>
>
> On Sat, Mar 17, 2018 at 7:59 AM, Nick Ruiz <nruiz at interactions.com> wrote:
>
>> Hi all,
>>
>> Can you help me identify any Urdu-Hindi parallel transliteration corpora
>> that are available on the web? By transliteration, I mean strictly the
>> conversion of writing systems, not translation. Thanks in advance!
>>
>> Kind regards,
>>
>> Nicholas Ruiz
>> Interactions Labs
>>
>> ************************************************************
>> *******************
>>
>> This e-mail and any of its attachments may contain Interactions LLC
>> proprietary information, which is privileged, confidential, or subject to
>> copyright belonging to the Interactions LLC. This e-mail is intended solely
>> for the use of the individual or entity to which it is addressed. If you
>> are not the intended recipient of this e-mail, you are hereby notified that
>> any dissemination, distribution, copying, or action taken in relation to
>> the contents of and attachments to this e-mail is strictly prohibited and
>> may be unlawful. If you have received this e-mail in error, please notify
>> the sender immediately and permanently delete the original and any copy of
>> this e-mail and any printout. Thank You.
>>
>> ************************************************************
>> *******************
>>
>> _______________________________________________
>> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
>> Corpora mailing list
>> Corpora at uib.no
>> https://mailman.uib.no/listinfo/corpora
>>
>>
>

--

*******************************************************************************

This e-mail and any of its attachments may contain Interactions LLC proprietary information, which is privileged, confidential, or subject to copyright belonging to the Interactions LLC. This e-mail is intended solely for the use of the individual or entity to which it is addressed. If you are not the intended recipient of this e-mail, you are hereby notified that any dissemination, distribution, copying, or action taken in relation to the contents of and attachments to this e-mail is strictly prohibited and may be unlawful. If you have received this e-mail in error, please notify the sender immediately and permanently delete the original and any copy of this e-mail and any printout. Thank You.

*******************************************************************************

-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 5461 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20180319/0e58150a/attachment.txt>



More information about the Corpora mailing list