[Corpora-List] Corpora Digest, Vol 73, Issue 34

Eman Alnafjan emanfahad at gmail.com
Thu Aug 1 23:43:11 CEST 2013


On Tuesday, July 30, 2013, wrote:

> Today's Topics:
> 1. Two open positions in computational linguistics, IMS
> Stuttgart (Sebastian Pado)
> 2. DEADLINE EXTENSION: Web Data as a Challenge for Theoretical
> Linguistics and Corpus Design (Marburg, March 5-7, 2014)
> (Roland Schäfer)
> 3. First release of an English lexicon for DBpedia (christina unger)
> 4. DEADLINE EXTENSION: IEEE Computational Intelligence Magazine
> (Impact Factor: 4.629) (Erik Cambria)
> 5. WorldCIST'14 - World Conference on IST, 15 - 18 April 2014,
> at Madeira Island (maria Lemos)
> 6. Fwd: Call for Participation: ACL/BSNLP -- The Fourth
> Workshop on Balto-Slavic Natural Language Processing (Roman
> Yangarber)
> ----------------------------------------------------------------------
> Message: 1
> Date: Mon, 29 Jul 2013 12:20:58 +0200
> From: Sebastian Pado <pado at ims.uni-stuttgart.de>
> Subject: [Corpora-List] Two open positions in computational
> linguistics, IMS Stuttgart
> To: corpora at uib.no
> One PhD and one postdoctoral position in Computational Linguistics
> -----------------------------------------------------------------
> The research group of Theoretical Computational Linguistics at IMS,
> Stuttgart University, is looking to fill two positions: one PhD
> student position and one postdoctoral researcher position. Payment is
> according to the German TV-L pay scale (TV-L 13 50% and 100%,
> respectively; see [1] for details). Both positions are initially open
> for three years.
> The research group, headed by Sebastian Pado, is concerned primarily
> with the cognitive and linguistic aspects of word and phrase meaning,
> machine learning methods for meaning acquisition, applications of
> lexical semantics as well as cross-lingual semantic analysis [2].
> * The PhD position is in the area of semantics of
> discourse. Prerequisite is a master's degree or diploma in
> computational linguistics, computer science, or a related field.
> * The postdoctoral researcher is expected to have a clear research
> profile, contribute to the research agenda, and conduct independent
> research. Prerequisite is a PhD degree in computational linguistics,
> computer science, or a related field.
> Candidates for both positions are expected to have substantial
> previous knowledge of natural language processing, solid programming
> skills. Prior experience in the specific areas outlined above is a
> plus. Since both positions come with teaching obligations, candidates
> need to have a good command of German, or be willing to acquire it
> quickly.
> Please send applications (a single PDF file containing a motivation
> letter, CV, list of publications, research statement, and names of two
> references) by email to Sebastian Pado, pado at ims.uni-stuttgart.de.
> Applications received by August 31st will receive full consideration.
> The positions are open until filled.
> For further information, send me an email to pado at ims.uni-stuttgart.de
> or meet me directly at ACL 2013.
> Stuttgart University is an equal opportunity employer. Applications of
> women are strongly encouraged. Severely challenged persons will be given
> preference in case of equal qualifications.
> [1] http://oeffentlicher-dienst.info/c/t/rechner/tv-l/west?id=tv-l-2013
> [2] http://www.nlpado.de/~sebastian
> ------------------------------
> Message: 2
> Date: Mon, 29 Jul 2013 13:30:33 +0200
> From: Roland Schäfer <roland.schaefer at fu-berlin.de>
> Subject: [Corpora-List] DEADLINE EXTENSION: Web Data as a Challenge
> for Theoretical Linguistics and Corpus Design (Marburg, March 5-7,
> 2014)
> To: list corpora <corpora at uib.no>
> *DEADLINE EXTENSION: August 12, 2013*
> Web Data as a Challenge for Theoretical Linguistics and Corpus Design
> Workshop at the 36th Annual Conference of the German Linguistic Society
> (March 5-7, 2014 at Marburg University, Marburg/Lahn, Germany)
> Website: http://hpsg.fu-berlin.de/cow/dgfs2014/
> EasyChair: https://www.easychair.org/conferences/?conf=webtl2014
> Organizers:
> Felix Bildhauer (Freie Universität Berlin/SFB632)
> Roland Schäfer (Freie Universität Berlin)
> Program Committee:
> Chris Biemann
> Stefan Evert
> Matthias Hüning
> Anke Lüdeling
> Alexander Mehler
> Uwe Quasthoff
> Amir Zeldes
> Torsten Zesch
> Arne Zeschel
> Important Dates:
> First call for papers: Monday, June 17, 2013
> Second call for papers: Friday, July 19, 2013
> EXTENDED Submission Deadline: Monday, August 12, 2013 @ 23:59 GMT
> Workshop: March 5-7, 2014
> *Aim of the Workshop and Call for Papers*
> The huge amounts of linguistic data on the web offer exciting new
> possibilities in empirically based theoretical linguistics. Web-derived
> linguistic resources can contain greater amounts of variation as well as
> non-standard grammar and writing compared to traditionally compiled
> corpora. Also, whole new registers and genres have been described to
> emerge on the web. Like spoken language - although clearly distinct from
> it - the language found on the web can thus challenge linguistic
> theories which are based mainly on standard written language as well as
> the categories assumed within these theories. At the same time, such
> non-standard features make the data harder to process for computational
> linguists, and additional care is required in making the decision of
> labeling material as "noise", because it might be considered valuable
> data by some linguists.
> This workshop aims to bring together researchers working in Theoretical
> Linguistics and Corpus Linguistics with those who create resources from
> web data. The primary question of the workshop is: Which new linguistic
> insights can we derive from web data? Secondarily, we ask how web data
> is (and how it should be) processed to produce easily accessible
> high-quality resources and thus facilitate this kind of innovative
> linguistic research.
> Possible subjects for talks include (but are by no means restricted to):
> - theoretically motivated empirical studies of linguistic phenomena in
> web data,
> - work on problems with established linguistic categories specific to
> certain types of web data (problems with traditional part-of-speech
> classification, syntactic categories, register and genre
> classification, etc.),
> - problems of working with web corpora from the user's perspective in
> concrete studies (low quality of: tokenization, POS tagging, named
> entity recognition, etc.; availability and lack of meta data),
> - assessments and improvements of the quality of available and newly
> designed tools and models to process or classify web data,
> - approaches to normalization of web data and evaluations of the
> acceptability of such normalizations from a linguistic perspective,
> - sampling of web data (e.g., stratified vs. randomly compiled corpora,
> linguistic web characterization)
> We invite submissions for 30 minute talks (20 minutes plus 10 minutes of
> discussion) about completed or ongoing original research in which web
> data is used or which is about the creation and/or evaluation of web
> data resources. The scope of the workshop is neither restricted to
> resources of a specific size or nature nor to any specific language(s).
> Submitted abstracts will be reviewed anonymously by at least two
> reviewers. We hope to offer authors of accepted talks the opportunity to
> publish an extended version of their talk in a special issue of a
> peer-reviewed corpus linguistics journal.
> *Submission Details*
> - Submitted abstracts for 30 minute presentations (20 minutes plus 10
> minutes discussion) should be between 800 and 1,000 words long
> (excluding references and tables).
> - Submissions must be anonymous. Please take care in removing
> information from the file which could reveal your identity.
> - The language of all abstracts and the workshop is English.
> - The only accepted file format for submission is PDF.
> - Submission must be made via EasyChair (WEBTL-2014):
> https://www.easychair.org/conferences/?conf=webtl2014
> - Authors of accepted papers will be asked to provide a shorter 200 word
> abstract to be printed in the conference program as an MS Word or
> OpenDocument file.
> ------------------------------
> Message: 3
> Date: Mon, 29 Jul 2013 17:39:37 +0200
> From: christina unger <cunger at cit-ec.uni-bielefeld.de>
> Subject: [Corpora-List] First release of an English lexicon for
> DBpedia
> To: corpora at uib.no
> Dear all,
> The Semantic Computing group at Bielefeld University [1] is happy to
> announce the first release of an English lexicon for DBpedia in lemon
> [2] format:
> http://lemon-model.net/lexica/dbpedia_en
> It comprises 1,217 verbalizations covering 353 classes as well as 300
> properties (all those that have more than 10,000 occurrences in the
> DBpedia dataset). It can prove useful for a wide range of ontology-based
> NLP applications, such as question answering and natural language
> generation from RDF or SPARQL.
> We invite you to browse and use the lexicon, and would love to get
> feedback.
> Also, everyone is welcome to improve the lexicon, extend it, and port it
> to other languages (we are currently working on an initial version of a
> Spanish and a German lexicon). The source is available on GitHub:
> https://github.com/cunger/lemon.dbpedia
> If you are interested in contributing, we'd be happy to hear from you!
> With best regards,
> Christina
> [1] http://www.sc.cit-ec.uni-bielefeld.de
> [2] http://lemon-model.net
> --
> Christina Unger, PhD
> Semantic Computing Group
> CITEC, Bielefeld University
> Office: H1-124
> Phone: 0521 106 12224
> Homepage: http://www.sc.cit-ec.uni-bielefeld.de/people/cunger
> ------------------------------
> Message: 4
> Date: Tue, 30 Jul 2013 01:21:58 +0800
> From: Erik Cambria <cambria at nus.edu.sg>
> Subject: [Corpora-List] DEADLINE EXTENSION: IEEE Computational
> Intelligence Magazine (Impact Factor: 4.629)
> To: Corpora-List <corpora at uib.no>
> Apologies for cross-posting.
> Due to the many extension requests, the submission deadline of the IEEE
> CIM special issue on Computational Intelligence for Natural Language
> Processing has been postponed to the end of this week. For more/up-to-date
> info, please visit http://sentic.net/cinlp
> The textual information available on the Web can be broadly grouped into
> two main categories: facts and opinions. Facts are objective expressions
> about entities or events. Opinions are usually subjective expressions that
> describe people's sentiments, appraisals, or feelings towards such entities
> and events. Much of the existing research on textual information processing
> has been focused on mining and retrieval of factual information, e.g., text
> classification, text recognition, text clustering, and many other text
> mining and natural language processing (NLP) tasks. Little work had been
> done on the processing of opinions until only recently.
> One of the main reasons for the lack of studies on opinions is the fact
> that there was little opinionated text available before the recent passage
> from a read-only to a read-write Web. Before that, in fact, when people
> needed to make a decision, they typically asked for opinions from friends
> and family. Similarly, when organizations wanted to find the opinions or
> sentiments of the general public about their products and services, they
> had to specifically ask people by conducting opinion polls and surveys.
> However, with the advent of the Social Web, the way people express their
> views and opinions has dramatically changed. They can now post reviews of
> products at merchant sites and express their views on almost anything in
> Internet forums, discussion groups, and blogs. Such online word-of-mouth
> behavior represents new and measurable sources of information with many
> practical applications. Nonetheless, finding opinion sources and monitoring
> them can be a formidable task because there are a large number of diverse
> sources and each source may also have a huge volume of opinionated text.
> In many cases, in fact, opinions are hidden in long forum posts and blogs.
> It is extremely time-consuming for a human reader to find relevant sources,
> extract related sentences with opinions, read them, summarize them, and
> organize them into usable forms. Thus, automated opinion discovery and
> summarization systems are needed. Sentiment analysis grows out of this
> need: it is a very challenging NLP or text mining problem. Due to its
> tremendous value for practical applications, there has been an explosive
> growth of both research in academia and applications in the industry.
> All the sentiment analysis tasks, however, are very challenging. Our
> understanding and knowledge of the problem and its solution are still
> limited. The main reason is that it is a NLP task, and NLP has no easy
> problems. Another reason may be due to our popular ways of doing research.
> So far, in fact, researchers have relied a lot on traditional machine
> learning algorithms. Some of the most effective machine learning
> algorithms, however, produce no human understandable results. Apart from
> some superficial knowledge gained in the manual feature engineering
> process, in fact, such algorithms may achieve improved accuracy, but little
> about how and why is actually known. All such approaches, moreover, rely on
> syntactic structure of text, which is far from the way human mind processes
> natural language.
> Articles are thus invited in area of computational intelligence for
> natural language processing and understanding. The broader context of the
> Special Issue comprehends artificial intelligence, knowledge representation
> and reasoning, data mining, artificial neural networks, evolutionary
> computation, and fuzzy logic. Topics include, but are not limited to:
> - Computational intelligence for big social data analysis
> - Biologically inspired opinion mining
> - Concept-level opinion and sentiment analysis
> - Computational intelligence for social media retrieval and analysis
> - Computational intelligence for social media marketing
> - Social network modeling, simulation, and visualization
> - Semantic multi-dimensional scaling for sentiment analysis
> - Computational intelligence for patient opinion mining
> - Sentic computing
> - Multilingual and multimodal sentiment analysis
> - Multimodal fusion for continuous interpretation of semantics
> - Computational intelligence for time-evolving sentiment tracking
> - Computational intelligence for cognitive agent-based computing
> - Human-agent, -computer, and -robot interaction
> - Domain adaptation for sentiment classification
> - Affective common-sense reasoning
> - Computational intelligence for user profiling and personalization
> - Computational intelligence for knowledge acquisition
> August 4th, 2013: Paper submission deadline
> September 1st, 2013: Notification of acceptance
> October 1st, 2013: Final manuscript due
> February, 2014: Publication
> The maximum length for the manuscript is typically 25 pages in single
> column with double-spacing, including figures and references. Authors of
> papers should specify in the first page of their manuscripts corresponding
> author?s contact and up to 5 keywords. Submission should be made via email
> to one of the guest editors below.
> - Erik Cambria, National University of Singapore (Singapore)
> - Bebo White, Stanford University (USA)
> - Tariq S. Durrani, Royal Society of Edinburgh (UK)
> - Newton Howard, MIT Media Laboratory (USA)
> ------------------------------
> Message: 5
> Date: Mon, 29 Jul 2013 23:09:50 +0100
> From: "maria Lemos" <marialemos72 at gmail.com>
> Subject: [Corpora-List] WorldCIST'14 - World Conference on IST, 15 -
> 18 April 2014, at Madeira Island
> To: corpora at uib.no
> Apologies if you are receiving this mail more than once...
> Please disseminate by colleagues, researchers, students, etc. Thanks a lot!
> **********************************************************************************
> WorldCIST'14
> The 2014 World Conference on Information Systems and Technologies
> April 15 - 18, Madeira Island, Portugal
-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 18508 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20130802/6cd4c169/attachment.txt>

More information about the Corpora mailing list