[Corpora-List] Final CfP MEDDOPROF Shared Task on occupation detection - Test set released!

Salvador Lima salvador.limalopez at gmail.com
Wed Jun 2 10:56:48 CEST 2021

[apologies for cross-posting]


MEDDOPROF Shared Task (IberLEF - SEPLN 2021)

Medical Documents Profession Recognition shared task


MEDDOPROF Cup Awards by BSC-Plan TL [3,000€]

We are organizing the first task specifically focusing on the automatic recognition and normalization (entity linking) of professions from medical documents.

Systems resulting from MEDDOPROF could contribute in using NLP techniques to extract critical health information related to patient occupations, such as:

- Exposure to toxic substances

- Infectious pathogens

- Allergies

- Work accidents

- Mental health issues We are happy to announce that the *MEDDOPROF test set has been released* and the *evaluation period has officially started*. You may download it from: https://zenodo.org/record/4889777. Remember that you have *until June 9th* to submit your systems! The Gold Standard annotations will be released on June 12th, once the evaluation period is over and results have been sent out.

In addition to the practical relevance of the track and the workshop (talks and proceedings), we will also have awards sponsored by BSC/Plan TL

As MEDDOPROF covers automatic normalization or linking to standard international multilingual terminologies (ESCO, SNOMED CT), it can also inspire the development of resources for other application scenarios (human resources, competitive intelligence, social services,..). The used guidelines, annotations & multi-lingual terminologies could also potentially be adapted to process documents in other languages.

MEDDOPROF sub-tracks:

MEDDOPROF-NER: automatic detection of mentions of occupations (profession, employment status and activities).

MEDDOPROF-CLASS: finding mentions of occupations and classifying them, whether they refer to the patients themselves, their family members or to healthcare professionals.

MEDDOPROF-NORM: mapping detected occupation mentions to their corresponding concept identifiers from standard multilingual occupation terminologies (ESCO and SNOMED-CT).

Key information:

MEDDOPROF web: https://temu.bsc.es/meddoprof/

Registration: https://temu.bsc.es/meddoprof/registration

Training Data + Complementary Entities Dataset: https://doi.org/10.5281/zenodo.4694768

*Test Data:* https://doi.org/10.5281/zenodo.4889776:

*Annotation Guidelines*: https://doi.org/10.5281/zenodo.4694675


Test set release (start of evaluation period): June 1st, 2021

End of evaluation period (system submissions): June 9th, 2021

Working papers submission: June 21st, 2021

Notification of acceptance (peer-reviews): June 27th, 2021

Camera-ready system descriptions: July 4th, 2021

IberLEF @ SEPLN 2021: September 2021

Publications and IBERLEF/SEPLN2021 workshop

Teams participating in MEDDOPROF will be invited to contribute a systems description paper for the IberLEF (SEPLN 2021) Working Notes proceedings, and a short presentation of their approach at the IberLEF 2021 workshop. More information about format, template, ... on https://temu.bsc.es/meddoprof/publications/

Main Organizers


Martin Krallinger, Barcelona Supercomputing Center, Spain


Eulŕlia Farré, Barcelona Supercomputing Center, Spain


Salvador Lima, Barcelona Supercomputing Center, Spain


Vicent Briva-Iglesias, D-REAL, Dublin City University, Ireland


Antonio Miranda-Escalada, Barcelona Supercomputing Center, Spain

Scientific Committee


Sophia Ananadiou, Department of Computer Science, University of

Manchester, UK


Alec Chapman, Data Scientist, University of Utah


Dina Demner-Fushman, Tenure Track Investigator, Biomedical Informatics

Branch, Lister Hill National Center for Biomedical Communications


Hercules Dalianis, Professor in Computer and Systems Science, Stockholm

University, Sweden


Hongfang Liu, Professor of Biomedical Informatics, Mayo Clinic


Josep Maria Haro Abad, Institut de Recerca Sant Joan de Déu


Bradley Malin, Accenture Professor of Biomedical Informatics,

Biostatistics, and Computer Science, Vanderbilt


Goran Nenadic, Department of Computer Science, University of Manchester,



Aurélie Névéol, LIMSI-CNRS, Université Paris-Sud, France


Řystein Nytrř, Department of Computer and Information Science, Norges

Teknisk-Naturvitenskapelige Universitet (NTNU)


Carlos Luis Parra Calderón, Head of Technological Innovation at Virgen

del Rocío University Hospital, Institute of Biomedicine of Seville, Spain


Kirk E. Roberts, School of Biomedical Informatics, University of Texas

Health Science Center


Francisco Javier Sanz Valero, Escuela Nacional de Medicina del Trabajo,

Instituto de Salud Carlos III, Spain


Stefan Schulz, Institute for Medical Informatics, Statistics and

Documentation, Medical University of Graz, Austria


Ashish Tendulkar, Machine Learning Specialist at Google


Michelle Turner, Assistant Research Professor at Barcelona Institute for

Global Health, Secretary-Treasurer International Society for Environmental

Epidemiology (ISEE)


Ozlem Uzuner, George Mason University


Alfonso Valencia Herrera, Barcelona Supercomputing Center (BSC-CNS),

Spain -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 37862 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20210602/d32a824d/attachment.txt>

More information about the Corpora mailing list