[Corpora-List] FW: CFP: KONVENS 2014 Workshop: NLP 4 CMC: NLP for Computer-Mediated Communication / Social Media

Torsten Zesch zesch at ukp.informatik.tu-darmstadt.de
Thu Mar 13 22:41:50 CET 2014

=================================================================== Workshop: NLP 4 CMC: NLP for Computer-Mediated Communication /Social Media


Pre-conference workshop at KONVENS2014 Hildesheim/Germany October 6, 2014.


Submission Deadline: June 15, 2014

TOPIC AND SCOPE OF THE WORKSHOP: Over the past decade, there has been a growing interest in collecting, processing and analyzing data from genres of social media and computer-mediated communication (CMC): As part of large corpora which have been automatically crawled from the WWW, CMC data are often regarded as an unloved "bycatch" which is difficult to handle with NLP tools that have been optimized for processing edited text; on the other hand, these data are important parts of web corpora for all research and application contexts which require data sets that represent the diversity of genres and linguistic variation on the web. For corpus-based variational linguistics, CMC corpora are an important resource for closing the "CMC gap" both in corpora of contemporary written language and in corpora of spoken language: Since CMC and social media make up an important part of everyday communication, investigations into language change and linguistic variation need to be able to include CMC and social media data into their empirical analyses.

Nevertheless, the development of approaches and tools for processing the linguistic and structural peculiarities of CMC genres and for building CMC corpora is lacking behind the interest of dealing with these types of data in the field of language technology, corpus-based linguistics and web mining.

The goal of this workshop is to provide a platform for the presentation of results and ongoing work in adapting NLP tools for processing CMC / social media data. The focus of the workshop is on German data, but submissions on NLP approaches, annotation experiments etc. for data of other European languages are also welcome as long as they can make a significant contribution to the further development of the processing of CMC phenomena.

TOPICS OF INTEREST: We encourage the submission of long and short research and demo papers including, but not restricted to the following topics related to social media / computer-mediated communication: * Corpora and lexical semantic resources for the analysis of social media / CMC * Normalization (spelling correction, ...) * Automatic preprocessing (tokenization, POS tagging, lemmatization, parsing, word sense disambiguation) * Annotation of linguistic and structural features in social media / CMC data (annotation schemas, annotation experiments, ...) * Domain adaptation * Automatic methods in corpus-based CMC / social media analysis (sentiment, summarization, trend detection, ...) * Big-data social media analysis

IMPORTANT DATES: * Submissions due: 15 June 2014 * Notification: 15 July 2014 * Camera-ready papers due: 30 August 2014 * Workshop: 6 October 2014

SUBMISSIONS: Submissions should include the names and addresses of all authors and meet the following requirements: * Full Papers (8 pages) * Short Papers (2-4 pages): position papers or work in progress * Demonstrations (2-4 pages): presentation of systems or prototypes * Submissions need to be made in English and should be in PDF format * Submissions need to follow the KONVENS format (http://www.uni-hildesheim.de/konvens2014/pages/Submissions.html)

Submissions will be accepted via the Easychair system: https://www.easychair.org/conferences/?conf=nlp4cmc

ORGANIZERS: * Michael Beißwenger (TU Dortmund University) * Torsten Zesch (University of Duisburg-Essen)

The workshop is organized by the special interest group "Social Media / Computer-Mediated Communication" of the German Scoiety for Computational Linguistics & Language Technology (GSCL) (http://gscl.org/ak-ibk.html).

PROGRAM COMITEE: * Sabine Bartsch (TU Darmstadt) * Thomas Bartz (TU Dortmund) * Michael Beißwenger (TU Dortmund) * Thierry Chanier (Clermont-Ferrand) * Isabella Chiari (Università "La Sapienza", Rome) * Stefanie Dipper (Ruhr-Universität Bochum) * Stefan Evert (Universität Erlangen) * Verena Henrich (Universität Tübingen) * Lothar Lemnitzer (BBAW, Berlin) * Anke Lüdeling (Humboldt-Universität Berlin) * Harald Lüngen (IDS, Mannheim) * Preslav Nakov (Qatar QCRI) * Günter Neumann (DFKI, Saarbrücken) * Melanie Neunerdt (RWTH Aachen) * Ines Rehbein (Universität Potsdam) * Egon W. Stemle (EURAC, Bozen) * Angelika Storrer (Universität Mannheim) * Kay-Michael Würzner (Universität Potsdam) * Torsten Zesch (Universität Duisburg-Essen)

WORKSHOP WEBSITE: https://sites.google.com/site/nlp4cmc/

More information about the Corpora mailing list