SEPLN 2012 workshop on:
Sentiment Analysis at SEPLN (TASS 2012)
Sep 7, 2012 Castellón, Spain
The rise of social media such as blogs and social networks and the increasing amount of user-generated contents in the form of reviews, recommendations, ratings and any other form of opinion, has led to creation of an emerging trend towards online reputation analysis. The so-called sentiment analysis, i.e., the application of natural language processing and text analytics to identify and extract subjective information from texts, which is the first step towards the online reputation analysis, is becoming a promising topic in the field of marketing and customer relationship management, as the social media and its associated word-of-mouth effect is turning out to be the most important source of information for companies and their customers' sentiments towards their brands and products.
Sentiment analysis is a major technological challenge. The task is so hard that even humans often disagree on the sentiment of a given text. Some issues that one individual finds acceptable or relevant may not be the same to others, along with multilingual aspects, cultural factors and different contexts make it very hard to classify a text written in a natural language into a positive or negative sentiment. And the shorter the text is, for example, when analyzing Twitter messages or short comments in Facebook, the harder the task becomes.
Within this context, TASS is an experimental evaluation workshop, as a satellite event of the SEPLN 2012 Conference. It will be held on September 7th, 2012 in Jaume I University at Castellón de la Plana, Comunidad Valenciana, Spain, to foster the research in the field of sentiment analysis in social media, specifically focused on Spanish language. The main objective is to promote the application of existing state-of-the-art algorithms and techniques and the design of new ones for the implementation of complex systems able to perform a sentiment analysis based on short text opinions extracted from social media messages (specifically Twitter) published by a series of representative personalities.
The challenge task is intended to provide a benchmark forum for comparing the latest approaches in this field. In addition, with the creation and release of the fully tagged corpus, we aim to provide a benchmark dataset that enables researchers to compare their algorithms and systems.
Two tasks are proposed for the participants: sentiment analysis and trending topic coverage.
*** Task 1: Sentiment Analysis ***
This task consists on performing an automatic sentiment analysis to determine the polarity of each message in the test corpus.
*** Task 2: Trending topic coverage ***
In this case, the technological challenge is to build a classifier to identify the topic of the text, and then apply the polarity analysis to get the valoration for each topic.
Julio Villena-Román Daedalus, Spain Cristina Moreno-García Daedalus, Spain Janine García-Morera Daedalus, Spain Linda Ferrer-Ureña Daedalus, Spain Sara Lana-Serrano Technical University of Madrid, Spain (GSI-UPM) José Carlos González-Cristóbal Technical University of Madrid, Spain (GSI-UPM) Adam Westerski Technical University of Madrid, Spain (GSI-UPM) L. Alfonso Ureña-López University of Jaén, Spain (SINAI-UJAEN) Miguel Ángel García-Cumbreras University of Jaén, Spain (SINAI-UJAEN) María-Teresa Martín-Valdivia University of Jaén, Spain (SINAI-UJAEN) Eugenio Martínez-Cámara University of Jaén, Spain (SINAI-UJAEN)
Alexandra Balahur EC-Joint Research Centre, Italy José Carlos Cortizo European University of Madrid, Spain Miguel Ángel García-Cumbreras University of Jaén, Spain Ana García-Serrano UNED, Spain José María Gómez-Hidalgo Optenet, Spain José Carlos González-Cristóbal Technical University of Madrid, Spain Julio Gonzalo-Arroyo UNED, Spain Carlos A. Iglesias-Fernández Technical University of Madrid, Spain Zornitsa Kozareva Information Sciences Institute, USA Sara Lana-Serrano Technical University of Madrid, Spain Bing Liu University of Illinois at Chicago, USA María Teresa Martín-Valdivia University of Jaén, Spain Paloma Martínez-Fernandez Carlos III University of Madrid, Spain Andrés Montoyo University of Alicante, Spain Rafael Muñoz University of Alicante, Spain Günter Neumann DFKI, Germany Paolo Rosso Technical University of Valencia, Spain Maite Taboada Simon Fraser University, Canada Mike Thelwall University of Wolverhampton, U.K. José Antonio Troyano University of Seville, Spain L. Alfonso Ureña-López University of Jaén, Spain Julio Villena-Román Daedalus, Spain
May 21st, 2012: Release of training and validation corpora. June 4th, 2012: Release of test corpus. June 15th, 2012: Deadline for registration for the tasks. July 9th, 2012: Experiment submissions by participants. August 1st, 2012: Evaluation results. August 12nd, 2012: Submission of papers. September 7th, 2012: Workshop.
tass at daedalux.es
Eugenio Martínez Cámara. Grupo de Investigación SINAI. Departamento de Informática. Universidad de Jaén. emcamara at ujaen dot es -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 16800 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20120530/e00c4672/attachment.txt>