[Corpora-List] Test data ready + Evaluation start – SemEval 2020 Task 1: Unsupervised Lexical Semantic Change Detection

Dominik Schlechtweg dominik.schlechtweg at gmx.de
Thu Feb 20 16:27:13 CET 2020

- Apologies for cross-posting -

Dear colleagues,

we are happy to announce that the *test data* for the SemEval 2020 Task 1 on Unsupervised Lexical Semantic Change Detection has been released. This also means the evaluation period starts. The deadlines have also been updated:

Test data ready: February 19, 2020 Evaluation start: February 19, 2020 Evaluation end: March 11, 2020 Paper submission due: April 17, 2020 Notification to authors: June 10, 2020 Camera ready due: July 1, 2020

This year, the SemEval workshop will be co-located with COLING in Barcelona (Spain), and held on September 13-14.

----------------- *** SemEval 2020 Task 1: Unsupervised Lexical Semantic Change Detection *** Shared Task Website: https://languagechange.org/semeval Organisers’ email: semeval2020lexicalsemanticchange at turing.ac.uk Google Group: https://groups.google.com/forum/#!forum/semeval-2020-task-1

Lexical Semantic Change (LSC) detection, i.e. the task of identifying words that change meaning over time, is a very active research area, with applications in NLP, lexicography, and linguistics. Evaluation is currently the most pressing problem in LSC detection, as no gold standards are available to the community, which hinders progress. We organize a shared task that addresses this gap by providing researchers with an evaluation frameworkand high-quality manually annotated datasets for English, German, Latin, and Swedish.

Participants are asked to solve two related tasks for computational LSC detection, aiming to identify change in senses of words over time using corpus data.

* Sub-Tasks *

We provide two time-specific (t1, t2) corpora for each language. Both tasks will be evaluated using ground truth as annotated by humans, native speakers (except for Latin, for which scholars of Latin were used).

Sub-Task 1 - Binary classification: given a set of words, determine which gained or lost senses between t1 and t2 and which did not. Sub-Task 2 - Ranking: given a set of words, rank them by the degree of LSCthey underwent between t1 and t2.

Task Organizers

Barbara McGillivray, University of Cambridge and The Alan Turing Institute, UK Dominik Schlechtweg, University of Stuttgart, DE Simon Hengchen, University of Helsinki, FI Haim Dubossarsky, University of Cambridge, UK Nina Tahmasebi, University of Gothenburg, SE

More information about the Corpora mailing list