we annotated once terms in a subset of the GNOME and KDE parallel documents in the IT domain, for English, German and Italian . Take a look and let me know if it is helpful.
On 25/04/17 18:25, Andraz Repar wrote:
> this is my first post to corporalist, so please be gentle:)
> I am looking for publicly available gold standard corpora for
> terminology extraction. Ideally, this would be a corpus where all
> terms have been annotated.
> I haven't been able to find any myself, and I realize this is probably
> a long shot. I would prefer European languages, but at this point I am
> not too picky and would take anything.
> Best regards,
> Andraž Repar
> International Postgraduate School Jožef Stefan, Ljubljana, Slovenia
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
-- Dr. Mihael Arcan Postdoctoral Researcher at Unit for Natural Language Processing (UNLP) Insight Centre for Data Analytics @ NUI Galway http://nuig.insight-centre.org/unlp/people/members/mihael-arcan/
-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 2589 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20170426/fe6f74c8/attachment.txt>