[Corpora-List] Shared Task on Slav-NER: Recognition, normalization, classificationand cross-lingual linking of Named Entities in Slavic languages

Roman Yangarber Roman.Yangarber at cs.helsinki.fi
Mon Dec 21 15:22:57 CET 2020


Call for Participation

3^rd Shared Task on Slav-NER:

*Recognition, Normalization, Classification and Cross-Lingual Linking of Named Entities in Slavic Languages* <http://bsnlp.cs.helsinki.fi/shared-task.html>

co-located with the BSNLP Workshop at EACL 2021

http://bsnlp.cs.helsinki.fi/

**

*SHARED TASK**DESCRIPTION*:

The 3^rd  edition of the Slav-NER Shared Task focuses on the analysis of Named Entities in multilingual Web documents in Slavic languages.

Due to rich inflection, free word order, derivation, and other phenomena present in the the Slavic languages, work on Named Entities poses a challenging task. Fostering research & development on the problems of Named Entities — detecting mentions of names, lemmatization (normalization), classification, and cross-lingual matching — is crucial for cross-lingual information access and wider use of NLP in Slavic languages.

The 3^rd  edition of the Shared Task covers six languages:

* Bulgarian,

* Czech,

* Polish,

* Russian,

* Slovene,

* Ukrainian.

and five types of named entities:

* persons,

* locations,

* organizations,

* events,

* products.

For information about training and test data, guidelines, registration and participation, *please* see the *Shared Task Home Page. <http://bsnlp.cs.helsinki.fi/shared-task.html>*

The Shared Task focuses on cross-lingual, document-level extraction of named entities — the systems should recognize, classify, and extract all named entity mentions in a document; detecting the /position/ of each named entity mention is not required. Named-entity mentions should be /lemmatized/, and mentions referring to the same real-world object should be linked across documents and languages. The input text collection consists of sets of documents retrieved from the Web, each set being about a certain entity or event. The corpus was obtained by crawling the Web and parsing the HTML of documents.

IMPORTANT: *it is NOT mandatory to participate in the full task*, e.g., monolingual responses, without lemmatization of the extracted named entities, can be evaluated also.

See the details about the 1^st  edition (2017)  <http://bsnlp-2017.cs.helsinki.fi/shared_task.html>and the 2^nd  edition (2019) <http://bsnlp.cs.helsinki.fi/bsnlp-2019/shared_task.html> of this shared task.

Participation

Teams that intend to participate should register by sending an email to: bsnlp at cs.helsinki.fi <mailto:bsnlp at cs.helsinki.fi>, which includes the following information:

* name of team,

* names of team members,

* contact person,

* contact email.

Important Dates

* Shared task announcement and call for participation: *1 December 2020*

* Release of (missing) training data: *21 December 2021*

* Registration deadline: *10 January 2021*

* Release of blind test data for registered participants: *25 January 2021*

* Submission of system responses: *27 January 2021*

* Sending results to participants: *28 January 2021*

* Shared task paper submission due (non-mandatory): *1 February 2021*

* Notification of acceptance: *18 February 2021*

* Camera-ready shared task papers due: *1 March*

--

Roman Yangarber Associate Professor, University of Helsinki INEQ: Helsinki Inequality Initiative — Linguistic Inequalities and Translation Technologies Digital Humanities ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

Computational linguistics www.cs.helsinki.fi/Roman.Yangarber Surveillance of news media puls.cs.helsinki.fi e-Learning & language learning revita.cs.helsinki.fi

Unioninkatu 40, Metsätalo B615 mobile: +358 50 41 51 71 3 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 31563 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20201221/b492817f/attachment.txt>



More information about the Corpora mailing list