To download the test data or see more information on the task, please visit http://alt.qcri.org/semeval2017/task2/ . Data is available in English, German, Italian, Spanish and Farsi.
This task provides a challenging benchmark for the evaluation of semantic similarity techniques, and in particular for word embeddings. The task also provides ten datasets for evaluating cross-lingual semantic similarity techniques as well as bilingual and multilingual word representations. The evaluation datasets provide a well-balanced set of word pairs, including domain specific words, multiword expressions and named entities.
ORGANIZERS
Jose Camacho Collados, Sapienza University of Rome, Italy
Mohammad Taher Pilehvar, Cambridge University, UK
Nigel Collier, Cambridge University, UK
Roberto Navigli, Sapienza University of Rome, Italy
-- José Camacho Collados Linguistic Computing Laboratory (LCL) Sapienza University of Rome http://wwwusers.di.uniroma1.it/~collados/ -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 5313 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20170109/9a780d67/attachment.txt>