Please find more information about the dataset and the outlier detection task in the reference paper. The dataset and the Python script to test your word embeddings are freely available at http://lcl.uniroma1.it/outlier-detection/
Reference:
José Camacho-Collados and Roberto Navigli. Find the word that does not belong: A Framework for an Intrinsic Evaluation of Word Vector Representations. In Proceedings of the ACL Workshop on Evaluating Vector Space Representations for NLP, Berlin, Germany, August 12, 2016.
http://lcl.uniroma1.it/outlier-detection/ACL16_REPEVAL_Outlier_Detection.pdf
Best regards,
José Camacho Collados and Roberto Navigli Linguistic Computing Laboratory, Sapienza University of Rome
-- José Camacho Collados Linguistic Computing Laboratory (LCL) Sapienza University of Rome http://wwwusers.di.uniroma1.it/~collados/ -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 6140 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20160627/05523baa/attachment.txt>