[Corpora-List] TRAINING CORPUS RELEASED BARR2 track - IberEval 2018

Montserrat Marimon montserrat.marimon at gmail.com
Thu May 17 15:39:15 CEST 2018


*** TRAINING CORPUS RELEASED BARR2 track - IberEval 2018 ***

Second Biomedical Abbreviation Recognition and Resolution track (BARR2)

BARR2 track workshop at SEPLN2018, September 18, Seville, Spain

http://temu.bsc.es/BARR2/

Overview

Finding and resolving abbreviations and symbols is a critical task not only for information retrieval, text classification, named entity recognition or even machine translation systems. Moreover, approaches to recognize and resolve abbreviations can often be directly adapted across different languages, resulting in resources of both widespread use and high impact. Nevertheless, due to the lack of exhaustively manually annotated abbreviation resolution corpora, in particular for certain key domains, evaluating and improving abbreviation resolution systems is still an active field of research.

In case of biomedical clinical texts, abbreviations are particularly frequent, often referring to entities and concepts of importance such as diseases, treatments, symptoms, drugs or biomolecular entities. Clinical NLP systems require correct abbreviation recognition and resolution.

Building on the success of the first BARR track posed at IberEval 2017 which covered the detection of short form – long form relations from the medical literature, the second BARR track requires detection of abbreviations from clinical case reports written in Spanish. With an estimated number of over 500 million Spanish speakers worldwide, recognition and resolution of abbreviations in Spanish clinical texts is an important task for Spanish clinical NLP tools.

The BARR2 track will be structured into two sub-tasks, namely:

· Sub-track 1: asking participating teams to provide systems able to detect only explicit occurrences of abbreviation-definition pairs

· Sub-track 2: provide resolution of short forms regardless whether its definitions is mentioned within the actual document

The primary evaluation metric used for the BARR2 track will consist in precision, recall, y f-score of the predictions against manual gold standard.

Additional details, sample sets, FAQ and inscription details can be found at: BARR2 track URL: http://temu.bsc.es/BARR2

Tentative track dates

-

20th April 2018: Sample data

-

16th May 2018: Training corpus available

-

23rd May 2018: Development corpus available

-

25th May 2018: Test corpus available

-

10th June 2018: Submission of team predictions

-

13th June 2018: Publication of results

-

20th June 2018: Working notes submission

-

23rd June 2018: Release of the working notes reviews

-

2nd July 2018: Camera ready paper submission

-

18th September 2018: Workshop IberEval 2018

Organizers

-

Martin Krallinger <krallinger.martin at gmail.com>, Biological Text Mining

Unit (Bio-TeMUC), CNIO, Spain

-

Alfonso Valencia <alfonso.valencia at bsc.es>, Life Sciences Department

Director, BSC, Spain

-

Nuria Bel <nuria.bel at upf.edu>, Department of Translation and Language

Sciences, UPF, Spain

-

Ander Intxaurrondo <ander.intxaurrondo at bsc.es>, Biological Text Mining

Unit (Bio-TeMUC), CNIO, Spain

-

Aitor Gonzalez-Agirre <aitor.gonzalezagirre at bsc.es>, Biological Text

Mining Unit (Bio-TeMUC), CNIO, Spain

-

Marta Villegas <marta.villegas at bsc.es>, Barcelona Supercomputing Center

(BIO-TeMUC), CNIO, Spain

-

Jose A. Lopez Martin <jalopezmartin at gmail.com>, Medical Oncology,

Hospital 12 de Octubre, Spain

-

Montserrat Marimon <montserrat.marimon at bsc.es>, Barcelona Supercomputing

Center (Bio-TeMUC), BSC, Spain

Scientific Advisory Board

-

Saber Ahmad Akhondi <s.akhondi at elsevier.com>, Principle NLP Scientist,

Elsevier Content & Innovation

-

Sophia Ananiadou <Sophia.Ananiadou at manchester.ac.uk>, Professor of the

School of Computer Science, University of Manchester / Director of the

National Centre for Text Mining (NaCTeM), UK

-

Marius Doornenbal <M.Doornenbal at elsevier.com>, Chief NLP Scientist,

Elsevier Content & Innovation

-

Fernando A. Navarro <fernando.a.navarro at telefonica.net>, MD, Cosnauta,

Siglas médicas en español; Founding member of TREMÉDICA, Spain

-

Carlos Luis Parra Calderón <carlos.parra.sspa at juntadeandalucia.es>, Head

of Technological Innovation Section, University Hospital Virgen del Rocio;

Director of biomedical informatics, Instituto de Biomedicina de Sevilla,

Spain

-

Hua Xu <Hua.Xu at uth.tmc.edu>, Director of the Center for Computational

Biomedicine, University of Texas Health Science Center at Houston, USA

--

Montserrat Marimon.-

montserrat.marimon at gmail.com Barcelona Supercomputing Center –

Centro Nacional de Supercomputación,

Life Sciences Department,

Carrer de Jordi Girona, 29-31,

Barcelona 08034, Spain tel. +34 93 4137569 -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 35815 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20180517/c1b6298e/attachment.txt>



More information about the Corpora mailing list