[Corpora-List] Arabic Corpora

Miloš Jakubíček milos.jakubicek at sketchengine.co.uk
Fri Feb 2 12:43:05 CET 2018


Dear Alia,

you could make use of the arTenTen12 (7 billion tokens) that we have in Sketch Engine -- drop me an e-mail if you'd be interested. Moreover, we have already built word embeddings from this corpus if that is what you are looking for, have a look at https://embeddings.sketchengine.co.uk (where you can find more languages, I will send a separate mail to this list about that.)

Best Milos

Milos Jakubicek

CEO, Lexical Computing Brno, CZ | Brighton UK http://www.lexicalcomputing.com http://www.sketchengine.co.uk

On 31 January 2018 at 20:02, Alia Bahanshal <a.bahanshal at gmail.com> wrote:


> Hello,
>
> Is there any open source Arabic corpora I can use for deep learning
> research purposes?
>
> Thanks
>
>
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> https://mailman.uib.no/listinfo/corpora
>
>
-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 2020 bytes Desc: not available URL: <https://www.uib.no/mailman/public/corpora/attachments/20180202/6f08d13b/attachment.txt>



More information about the Corpora mailing list