[Corpora-List] Gold standard for document similarity

Ivelina Nikolova iva at lml.bas.bg
Tue Mar 4 16:48:08 CET 2014


Dear corpora members,

I am looking for a gold standard to train/evaluate document similarity metrics. Can anyone suggest a suitable corpus for such purposes. I'm especially interested in similarity between newspaper articles.

Thanks in advance, Ivelina

-- Ivelina Nikolova PhD student in Computer Science Linguistic Modelling Department Institute of Information and Communication Technologies Bulgarian Academy of Sciences



More information about the Corpora mailing list