[Corpora-List] (New book) Farzindar & Inkpen: NLP for Social Media

Graeme Hirst gh at cs.toronto.edu
Mon Sep 14 20:35:19 CEST 2015


Natural Language Processing for Social Media

by Atefeh Farzindar (NLP Technologies Inc. and Université de Montréal) and Diana Inkpen (University of Ottawa)

Synthesis Lectures on Human Language Technologies #30 (Morgan & Claypool Publishers), 2015, 168 pages


In recent years, online social networking has revolutionized interpersonal communication. The newer research on language analysis in social media has been increasingly focusing on the latter’s impact on our daily lives, both on a personal and a professional level. Natural language processing (NLP) is one of the most promising avenues for social media data processing. It is a scientific challenge to develop powerful methods and algorithms which extract relevant information from a large volume of data coming from multiple sources and languages in various formats or in free form. We discuss the challenges in analyzing social media texts in contrast with traditional documents.

Research methods in information extraction, automatic categorization and clustering, automatic summarization and indexing, and statistical machine translation need to be adapted to a new kind of data. This book reviews the current research on Natural Language Processing (NLP) tools and methods for processing the non-traditional information from social media data that is available in large amounts (big data), and shows how innovative NLP approaches can integrate appropriate linguistic information in various fields such as social media monitoring, health care, business intelligence, industry, marketing, and security and defense.

We review the existing evaluation metrics for NLP and social media applications, and the new efforts in evaluation campaigns or shared tasks on new datasets collected from social media. Such tasks are organized by the Association for Computational Linguistics (such as SemEval tasks) or by the National Institute of Standards and Technology via the Text REtrieval Conference (TREC) and the Text Analysis Conference (TAC). In the concluding chapter, we discuss the importance of this dynamic discipline and its great potential for NLP in the coming decade, in the context of changes in mobile technology, cloud computing, and social networking.

http://www.morganclaypool.com/doi/abs/10.2200/S00659ED1V01Y201508HLT030 <http://www.morganclaypool.com/doi/abs/10.2200/S00659ED1V01Y201508HLT030>

This title is available online without charge to members of institutions that have licensed the Synthesis Digital Library of Engineering and Computer Science. Members of licensing institutions have unlimited access to download, save, and print the PDF without restriction; use of the book as a course text is encouraged. To find out whether your institution is a subscriber, visit http://www.morganclaypool.com/page/licensed <http://www.morganclaypool.com/page/licensed>, or just click on the book's URL above from an institutional IP address and attempt to download the PDF. Others may purchase the book from this URL as a PDF download for US$30 or in print for US$40. Printed copies are also available from Amazon and from booksellers worldwide at approximately US$45 or local currency equivalent.

:::: Graeme Hirst • Series editor, Synthesis Lectures in Human Language Technologies
:::: University of Toronto • Department of Computer Science
-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 5245 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20150914/684da20c/attachment.txt>

More information about the Corpora mailing list