[Corpora-List] CfP: SemEval-2017 Shared Task on Detection and Interpretation of English Puns

Tristan Miller miller at ukp.informatik.tu-darmstadt.de
Fri Jul 1 16:41:29 CEST 2016

==== Call for Participation ====

SemEval-2017 Shared Task: Detection and Interpretation of English Puns


Researchers and industry professionals are invited to participate in a shared task on the computational detection and interpretation of English puns. The task will occur as part of the SemEval-2017 workshop, to be held in conjunction with a major NLP conference (TBA) in the summer of 2017. SemEval is an ongoing series of evaluations of computational semantic analysis systems, organized under the aegis of SIGLEX, the Special Interest Group on the Lexicon of the Association for Computational Linguistics.

---- Task description ----

A pun is a form of wordplay in which one signifier (e.g., a word or phrase) suggests two or more meanings by exploiting polysemy, or phonological similarity to another signifier, for an intended humorous or rhetorical effect. Puns where the two meanings share the same spelling are known as homographic, whereas those where the two meanings are spelled (and also usually pronounced) differently are known as heterographic.

Conscious or tacit linguistic knowledge -- particularly of lexical semantics and phonology -- is an essential prerequisite for the production and interpretation of puns. This has long made them an attractive subject of study in theoretical linguistics, and has led to a small but growing body of research into puns in computational linguistics. This SemEval shared task will be the first organized evaluation of automatic pun processing systems.

Participants will be provided with two data sets. The first data set will contain several thousand short contexts (jokes, slogans, aphorisms, etc.). In some of these contexts, a single word will be used as a homographic pun; in the rest, there will be no pun. The second data set will be similar to the first, except that the puns will be heterographic rather than homographic. For one or both data sets, participating systems will compete in any or all of three subtasks:

Subtask 1: Pun detection. For this subtask, participants are given an entire raw data set. For each context, the system must decide whether or not it contains a pun.

Subtask 2: Pun location. For this subtask, the contexts not containing puns are removed from the data set. For each context, the system must identify which word is the pun.

Subtask 3: Pun interpretation. For this subtask, the pun word in each context is marked, and contexts where the pun's two meanings are not found in WordNet are removed from the data set. For each context, the system must annotate the two meanings of the given pun by reference to WordNet sense keys.

For the first two subtasks, system performance will be measured with the usual precision and recall metrics from information retrieval, and for the third subtask, we will use slightly modified versions of the precision and recall metrics used for WSD.

---- Practical information ----

The following schedule is adapted from the SemEval-2017 call for task proposals and is subject to change.

* July 1, 2016: Trial data ready * January 10, 2017: Evaluation start * January 31, 2017: Evaluation end * February 28, 2017: Paper submission due * March 31, 2017: Paper reviews due * April 30, 2017: Camera-ready submission due * Summer 2017: SemEval-2017 workshop

---- Organizing committee ----

* Tristan Miller, UKP Lab, Technische Universitšt Darmstadt * Christian F. Hempelmann, Ontological Semantic Technology Lab, Texas A&M University-Commerce * Iryna Gurevych, UKP Lab, Technische Universitšt Darmstadt

To contact the organizing committee, please e-mail Tristan Miller.

-- Tristan Miller, Research Scientist Ubiquitous Knowledge Processing Lab (UKP-TUDA) Department of Computer Science, Technische Universitšt Darmstadt Tel: +49 6151 162 5296 | Web: https://www.ukp.tu-darmstadt.de/

-------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 819 bytes Desc: OpenPGP digital signature URL: <https://mailman.uib.no/public/corpora/attachments/20160701/4ce0ef57/attachment.asc>

More information about the Corpora mailing list