[Corpora-List] arabic transliteration

Motaz SAAD motaz.saad at inria.fr
Tue May 29 12:57:32 CEST 2012


Hello,

You can find Arabic transliteration implementation in perl at http://search.cpan.org/dist/Encode-Arabic/lib/Encode/Arabic/Buckwalter.pm and in python at https://github.com/guysoft/3arabi

all the best, Motaz

----- Original Message -----


> From: "Imene Bensalem" <bens.imene at gmail.com>
> To: corpora at uib.no
> Sent: Saturday, May 26, 2012 2:53:21 PM
> Subject: Re: [Corpora-List] arabic transliteration


> Hi Amira,


> You could try the Buckwalter transliteration
> http://www.qamus.org/transliteration.htm , but is does not support
> punctuation, so if your task needs to keep punctuation and you want
> to encode the transliterated text in ANSI or ASCII, you have to add
> to the Buckwalter table a transliteration of the Arabic punctuation
> marks; for example, Arabic comma should be transliterated to Latin
> comma.


> All the best !


> Imene


> > On Sat, May 26, 2012 at 11:00 AM, < corpora-request at uib.no > wrote:
>


> > > ------------------------------
> >
>


> > > Message: 8
> >
>
> > > Date: Fri, 25 May 2012 21:07:02 +0100 (BST)
> >
>
> > > From: Amira Barhoumi < amirabarhoumi29 at yahoo.fr >
> >
>
> > > Subject: [Corpora-List] arabic transliteration
> >
>
> > > To: " Corpora at uib.no " < Corpora at uib.no >
> >
>


> > > Hello,
> >
>


> > > I want to know how doing transliteration of an Arabic sentence.
> >
>


> > > Is there any norm?
> >
>


> > > Thanks,
> >
>
> > > Amira
> >
>


> > > Master student in computer science
> >
>
> > > -------------- next part --------------
> >
>
> > > A non-text attachment was scrubbed...
> >
>
> > > Name: not available
> >
>
> > > Type: text/html
> >
>
> > > Size: 647 bytes
> >
>
> > > Desc: not available
> >
>
> > > URL: <
> > > https://mailman.uib.no/public/corpora/attachments/20120525/15943f83/attachment.txt
> > > >
> >
>


> > > ------------------------------
> >
>


> > > Message: 9
> >
>
> > > Date: Sat, 26 May 2012 00:31:51 +0000
> >
>
> > > From: Yassine Benajiba < benajibayassine at gmail.com >
> >
>
> > > Subject: Re: [Corpora-List] arabic transliteration
> >
>
> > > To: Amira Barhoumi < amirabarhoumi29 at yahoo.fr >
> >
>
> > > Cc: " Corpora at uib.no " < Corpora at uib.no >
> >
>


> > > Did you look at Buckwalter transliteration? .. it's not a norm
> > > but
> > > I
> > > think
> >
>
> > > most of the people doing Arabic NLP use it. Here is a link:
> >
>
> > > http://www.qamus.org/transliteration.htm
> >
>


> > > Good luck!
> >
>
> > > --Yassine.
> >
>


> > > On Fri, May 25, 2012 at 8:07 PM, Amira Barhoumi <
> > > amirabarhoumi29 at yahoo.fr >wrote:
> >
>


> > > > Hello,
> >
>
> > > >
> >
>
> > > > I want to know how doing transliteration of an Arabic sentence.
> >
>
> > > > Is there any norm?
> >
>
> > > >
> >
>
> > > > Thanks,
> >
>
> > > > Amira
> >
>
> > > > Master student in computer science
> >
>
> > > >
> >
>
> > > >
> >
>
> > > >
> >
>
> > > >
> >
>
> > > > _______________________________________________
> >
>
> > > > UNSUBSCRIBE from this page:
> > > > http://mailman.uib.no/options/corpora
> >
>
> > > > Corpora mailing list
> >
>
> > > > Corpora at uib.no
> >
>
> > > > http://mailman.uib.no/listinfo/corpora
> >
>
> > > >
> >
>
> > > >
> >
>
> > > -------------- next part --------------
> >
>
> > > A non-text attachment was scrubbed...
> >
>
> > > Name: not available
> >
>
> > > Type: text/html
> >
>
> > > Size: 1595 bytes
> >
>
> > > Desc: not available
> >
>
> > > URL: <
> > > https://mailman.uib.no/public/corpora/attachments/20120526/65b4f811/attachment.txt
> > > >
> >
>


> > > ----------------------------------------------------------------------
> >
>
> > > Send Corpora mailing list submissions to
> >
>
> > > corpora at uib.no
> >
>


> > > To subscribe or unsubscribe via the World Wide Web, visit
> >
>
> > > http://mailman.uib.no/listinfo/corpora
> >
>
> > > or, via email, send a message with subject or body 'help' to
> >
>
> > > corpora-request at uib.no
> >
>


> > > You can reach the person managing the list at
> >
>
> > > corpora-owner at uib.no
> >
>


> > > When replying, please edit your Subject line so it is more
> > > specific
> >
>
> > > than "Re: Contents of Corpora digest..."
> >
>


> > > _______________________________________________
> >
>
> > > Corpora mailing list
> >
>
> > > Corpora at uib.no
> >
>
> > > http://mailman.uib.no/listinfo/corpora
> >
>


> > > End of Corpora Digest, Vol 59, Issue 31
> >
>
> > > ***************************************
> >
>


> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 6953 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20120529/29f531ea/attachment.txt>



More information about the Corpora mailing list