We have developed a new dataset (including an easy-to-use Python scorer for word embeddings) based on the outlier detection task. Given a group of words, the goal of the outlier detection task is to identify the word that does not belong in the group. For example, book would be an outlier for the set of words {apple, banana, lemon, book, orange}, as it is not a fruit like the others. This task is particularly suitable to test interesting properties of word vectors not fully addressed to date in common intrinsic evaluation benchmarks such as word similarity. Although the task is quite well-defined and humans achieve a near-perfect performance, this task is still challenging for state-of-the-art word embeddings.

Please find more information about the dataset and the outlier detection task in the reference paper. The dataset and the Python script to test your word embeddings are freely available at http://lcl.uniroma1.it/outlier-detection/


Josť Camacho-Collados and Roberto Navigli. Find the word that does not belong: A Framework for an Intrinsic Evaluation of Word Vector Representations. In Proceedings of the ACL Workshop on Evaluating Vector Space Representations for NLP, Berlin, Germany, August 12, 2016.


