[Corpora-List] Crowdsourcing for arabic tasks

Mahmoud EL-Haj dr.melhaj at gmail.com
Fri Aug 7 16:38:40 CEST 2015

Dear Imen,

We have done some work on using AMT/Crowdflower to summarise Arabic documents [1] and also using experts to build Arabic resources needed for MultiLing 2011/2013 tasks [2],[3]. In the meanwhile we are working on hiring Arabic native speaker experts to build an Arabic lexicon.

You should be able to run your tasks on AMT but you need to be careful when it comes to evaluating the quality and consistency of the results.


[1] Using Mechanical Turk to Create a Corpus of Arabic Summaries:


[2] TAC 2011 MultiLing Pilot Overview


[3] Multi-document multilingual summarization corpus preparation, Part 1: Arabic, English, Greek, Chinese, Romanian


Best, Mahmoud


Dr Mahmoud El-Haj

Senior Research Associate

School of Computing and Communications

Lancaster University

<http://www.lancaster.ac.uk/staff/elhaj/> http://www.lancaster.ac.uk/staff/elhaj/

<mailto:m.el-haj at lancaster.ac.uk> m.el-haj at lancaster.ac.uk

From: corpora-bounces at uib.no [mailto:corpora-bounces at uib.no] On Behalf Of imen touati Sent: Wednesday, August 05, 2015 9:10 PM To: corpora at uib.no Subject: [Corpora-List] Crowdsourcing for arabic tasks

Dear all,

Please have any one idea about any group like Amazon Mechanical turk that doing crowdsourcing for tasks in arabic and that employs arabic language expert (more precisely the task is the annotation of arabic documents in order to do opinion mining and sentiment analysis ).

Thank you in advance.


Imen Touati

PhD Computer Science student

Faculty of Economic Sciences and management of Sfax ANLP Research Group

<http://sites.google.com/site/anlprg> http://sites.google.com/site/anlprg

MIRACL Laboratory

<http://www.miracl.rnu.tn/> www.miracl.rnu.tn

Address : FSEGS, BP 1088, 3018 Sfax, Tunisia Email : <mailto:bayoudhi.amine at gmail.com> ismi_touati at yahoo.fr

-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 11313 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20150807/0378501a/attachment.txt>

More information about the Corpora mailing list