[Corpora-List] DETEction and classification of racial STereotypes in Spanish (DETESTS) at IberLEF 2022

Wolfgang Sebastian Schmeisser Nieto wolfgang.schmeisser at ub.edu
Tue Mar 22 18:17:56 CET 2022

Please consider contributing and/or forwarding to appropriate colleagues and groups.

*****We apologize for the multiple copies of this e-mail*****


IberLEF 2022 Task: DETESTS (DETEction and classification of racial Stereotypes in Spanish)

This task will take part of IberLEF 2022<https://sites.google.com/view/iberlef2022>, the 4th Workshop on Iberian Languages Evaluation Forum at the SEPLN 2022 Conference, which will be held in A Coruña, Spain, on September 20th.

The aim of the task is to detect and classify stereotypes in sentences from comments posted in Spanish in response to different online news articles related to immigration. The task is designed in a hierarchical fashion by chaining two subtasks:

* ​Subtask 1: Participants tackling this problem will have to determine whether a sentence contains at least one stereotype or none considering the full distribution of labels provided by the annotators based on the proposal of learning with disagreements. The actual gold label of this subtask is left as a proxy to determine the subset of sentences that will be evaluated in the posterior subtask.

* Subtask 2: This subtask consists of determining whether a sentence contains at least one stereotype or none and assigning those sentences previously marked as positive (with stereotypes) to ten categories. Since a sentence can contain multiple stereotypes belonging to different categories, this subtask will be presented as a multi-label hierarchical classification problem.

Participants are allowed to participate just in one of them (e.g., subtask 1). Teams will be allowed (and encouraged) to submit multiple runs (max. 5).

The present task is proposed to participants interested in racial, national, or ethnic stereotype detection and classification tasks. Furthermore, the annotated dataset is a valuable resource for exploratory linguistic analysis, as well as for comparing the application of deep learning and classical machine learning models on Spanish stereotyped expressions under the recently introduced learning with disagreements paradigm. Participants will be provided with the annotated data by each of the annotators and the gold standard.

Linguistic resources:

Our DETESTS corpus consis made up of comments (at least 50) segmented onto sentences published in response to manually selected articles extracted from Spanish online newspapers. The common topic of all articles is immigration. It consists of 5,629 sentences. We will provide participants with 70% of the dataset to train their models, while the remaining 30% will be used to test their models.

To avoid any conflict with the sources of the comments regarding their intellectual property rights (IPR), the data will be sent privately to each participant who is interested in the task. The corpus will only be made available for research purposes.

Important dates (All deadlines are 11:59 PM UTC-12:00):

Training dataset release: March 21, 2022 (Already available at the website)

Test dataset release: April 20, 2022

Systems results: May 16, 2022

Results notification: May 23, 2022

Working papers submission: June 9, 2022

Working papers (peer-)reviewed: June 20, 2022

Camera-ready versions: July 4, 2022

Workshop at IberLEF 2022: September 20, 2022

Task organizers:

* Mariona Taulé (Universitat de Barcelona, UB)

* Wolfgang Schmeisser (Universitat de Barcelona, UB)

* Alejandro Ariza (Universitat de Barcelona, UB)

* Montserrat Nofre (Universitat de Barcelona, UB)

* Enrique Amigó (Universidad Nacional de Educación a Distancia, UNED)

* Paolo Rosso (Universitat Politècnica de València, UPV)

* Berta Chulvi (Universitat Politècnica de València, UPV)


Join our Google Groups<https://groups.google.com/g/detests-iberlef-2022> to be kept up to date with the latest news related to the task or write us to detests.iberlef at gmail.com<mailto:detests.iberlef at gmail.com>.

For more information, please visit our website detestsiberlef.wixsite.com/detests<https://detestsiberlef.wixsite.com/detests>.


European project 'STERHEOTYPES-Studying European Racial Hoaxes and Sterheotypes' funded by 'Challenge for Europe' call for Project, Compagnia San Paolo (CUP: B99C20000640007).

Grant XAI-DisInfodemics: IA explicable para desinformación y detección de conspiración durante infodemias (PLEC2021-007681) funded by MCIN/AEI/10.13039/501100011033 and, as appropriate, by the “European Union NextGenerationEU/PRTR”.

Aquest missatge, i els fitxers adjunts que hi pugui haver, pot contenir informació confidencial o protegida legalment i s’adreça exclusivament a la persona o entitat destinatària. Si no consteu com a destinatari final o no teniu l’encàrrec de rebre’l, no esteu autoritzat a llegir-lo, retenir-lo, modificar-lo, distribuir-lo, copiar-lo ni a revelar-ne el contingut. Si l’heu rebut per error, informeu-ne el remitent i elimineu del sistema tant el missatge com els fitxers adjunts que hi pugui haver.

Este mensaje, y los ficheros adjuntos que pueda incluir, puede contener información confidencial o legalmente protegida y está exclusivamente dirigido a la persona o entidad destinataria. Si usted no consta como destinatario final ni es la persona encargada de recibirlo, no está autorizado a leerlo, retenerlo, modificarlo, distribuirlo o copiarlo, ni a revelar su contenido. Si lo ha recibido por error, informe de ello al remitente y elimine del sistema tanto el mensaje como los ficheros adjuntos que pueda contener.

This email message and any attachments it carries may contain confidential or legally protected material and are intended solely for the individual or organization to whom they are addressed. If you are not the intended recipient of this message or the person responsible for processing it, then you are not authorized to read, save, modify, send, copy or disclose any part of it. If you have received the message by mistake, please inform the sender of this and eliminate the message and any attachments it carries from your account. -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 37678 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20220322/0b0ac904/attachment.txt>

More information about the Corpora mailing list