we are undertaking an exercise in identifying all the 'good' collocations for a set of headwords, so we have a gold standard dataset for evaluating a range of algorithms and resources. Our main reference point is Oxford Collocations Dictionary.
Is there anyone else who has done a similar task and has annotator guidelines for what to include? (Tough cases include titles, quotations, lines from songs, and many items half way to being names)
-- ======================================== Adam Kilgarriff <http://www.kilgarriff.co.uk/> adam at lexmasterclass.com Director Lexical Computing Ltd<http://www.sketchengine.co.uk/>
Visiting Research Fellow University of Leeds<http://leeds.ac.uk>
*Corpora for all* with the Sketch Engine <http://www.sketchengine.co.uk>
*DANTE: a lexical database for English<http://www.webdante.com>
* ======================================== -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 1582 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20121127/014b7eef/attachment.txt>