The 2009 CoNLL Shared Task developed syntactic dependency annotations, including the semantic dependency model roles of both verbal and nominal predicates. The following English data was used in the shared task:
* Treebank-2 (LDC95T7)<https://catalog.ldc.upenn.edu/LDC95T7>: over one million words of annotated English newswire and other text developed by the University of Pennsylvania
* Proposition Bank I (LDC2004T14)<https://catalog.ldc.upenn.edu/LDC2012T04>: semantic annotation of newswire text from Treebank-2 developed by the University of Pennsylvania
* NomBank v 1.0 (LDC2008T23)<https://catalog.ldc.upenn.edu/LDC2008T23>: argument structure for instances of common nouns in Treebank-2 and Treebank-3 (LDC99T42)<https://catalog.ldc.upenn.edu/LDC99T42>, developed by New York University For X-SRL, the English source data was automatically translated using DeepL<https://www.deepl.com/en/translator>. Automatic tokenization, lemmatization, part-of-speech tagging, and syntactic parsing were then applied to the text. The data was divided into train, development, and test partitions. Semantic labels were transferred for the train and development sections, and the test sentences were validated for translation quality, alignment, label transfer, and filtering.
X-SRL: Parallel Cross-lingual Semantic Role Labeling is distributed via web download.
2021 Subscription Members will automatically receive copies of this corpus. 2021 Standard Members may request a copy as part of their 16 free membership corpora. Non-members may license this data for a fee. * (2) TAC KBP English Sentiment Slot Filling - Comprehensive Training and Evaluation Data 2013-2014<https://catalog.ldc.upenn.edu/LDC2021T08> was developed by LDC and contains training and evaluation data produced in support of the 2013<https://tac.nist.gov/2013/KBP/SentimentSF/index.html> and 2014<https://tac.nist.gov/2014/KBP/Sentiment/index.html> TAC KBP Sentiment Slot Filling tracks. The data in this release includes queries, manual runs (human-produced query responses), and assessment results for human- and system-produced query responses. Source data was English news and web text.
The regular English Slot Filling track involved mining information about entities from text using a specified set of "slots", or attributes. The goal of the Sentiment Slot Filling task was to evaluate the quality of detectors for positive and negative sentiment.
TAC KBP English Sentiment Slot filling - Comprehensive Training and Evaluation Data 2013-2014 is distributed via web download.
2021 Subscription Members will automatically receive copies of this corpus. 2021 Standard Members may request a copy as part of their 16 free membership corpora. Non-members may license this data for a fee.
Membership Coordinator Linguistic Data Consortium<ldc.upenn.edu> University of Pennsylvania T: +1-215-573-1275 E: ldc at ldc.upenn.edu<mailto:ldc at ldc.upenn.edu> M: 3600 Market St. Suite 810
Philadelphia, PA 19104
-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 10700 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20210415/98d30d3b/attachment.txt>