[Corpora-List] {Spam?} Re: {Spam?} Corpora Digest, Vol 115, Issue 23

Ludivine Crible ludivine.crible at uclouvain.be
Tue Jan 17 09:03:04 CET 2017

Call for Papers: Discourse markers in Romance Languages

[Apologies for cross-posting]

5th International Conference Discourse Markers in Romance Languages: Boundaries and Interfaces 8 – 10 November 2017 Université catholique de Louvain (Belgium) https://disrom2017.wordpress.com/

The Linguistic Research Unit of the Université catholique de Louvain (UCL) invites researchers on discourse markers in Romance languages to submit contributions for the 5th International Conference on Discourse Markers in Romance Languages: Boundaries and Interfaces, which will be held from November 8 to November 10, 2017 in Louvain-la-Neuve (Belgium).

The Conference is one of a series of conferences on discourse markers in Romance languages (Madrid, 2010; Buenos Aires, 2011; Campinas, 2012; Heidelberg 2015) and aims to build on the previous events. The conference serves as a platform for internationally renowned linguists and young researchers alike to exchange views and ideas and to broaden their research perspectives. This year’s theme will specifically question issues of categorization and boundaries of the class of discourse markers. In this respect, discourse markers studies invite to further investigate the interfaces between several levels of linguistic analysis (Prosody, Syntax, Semantics, Pragmatics, Discourse).

With special reference to discourse marker research, the conference will be an opportunity to look at the findings provided so far by theoretical and applied linguistics and related disciplines. The following plenary speakers have already confirmed their attendance:

Dr. Catherine Bolly, Universität zu Köln: Multimodal pragmatic constructions at the speech-gesture interface applied to French Prof. Maria Josep Cuenca, Universitat de Valčncia: Contrastive markers in contrast in Spanish and Catalan Dr. Amália Mendes, Universidade de Lisboa: Lexicographic applications of discourse markers in Portuguese Prof. Jacqueline Visconti, Universitŕ degli Studi di Genova: On the diachrony of discourse markers, with a focus on Italian


We welcome abstracts for papers and posters on any theoretical, empirical or applied topic related to discourse markers in Romance languages, including (but not restricted to):

- definition of the discourse marker category; - lexicons of discourse markers; - discourse markers and their relation to other pragmatic categories; - syntax-prosody-discourse interface; - sociolinguistic approaches to discourse markers; - variation of discourse markers across registers, languages and language varieties; - translation studies; - L1 and L2 acquisition of discourse markers; - diachronic studies; - experimental studies; - corpus-based and computational studies; - applied studies (business language, legal discourse, educational settings, etc.).

Conference presentation modalities

1. Oral presentations - Presentations should be no longer than 20 minutes. Presentations will be followed by a 10- minute discussion

2. Posters - Posters should not be larger than A0 size (118.9 x 84.1 cm, landscape or portrait) - Posters may be printed in colour or in black and white - Authors must bring the printed posters to the conference - Contributors are requested to be present during the poster sessions - The poster session(s) will be preceded by a short one-minute poster boost session where authors will have the opportunity to present the topic of their research.

Abstracts and conference languages

Presentations may be held in English or any Romance language. Plenary conferences will be held in English.

Submission of abstracts Abstracts should not exceed one page (single spacing, 12-point Times New Roman font, not including figures and references, and must be uploaded as pdf). Abstracts can be written in English or any Romance language.

Please submit your abstracts to https://easychair.org/conferences/?conf=disrom2017

For the submission of abstracts for posters, you can either select the poster option on the online registration site or apply for both an oral presentation and a poster contribution. In the latter case, the scientific committee reserves the right to decide which option will be more appropriate for the proposal submitted.

Important dates

15 April 2017 Deadline for abstract submission 15 June 2017 Notification of acceptance 8 – 10 November 2017 Conference dates

Registration will be possible from July 1, 2017

Conference fees

Early-bird fee Regular fee

(by 15 September 2017)

Doctoral students 80 € 100 € Other participants 120 € 140 €

Conference dinner 50 €

Scientific Committee To be completed • Esperanza Acín Villa (Universidade de A Coruña) • Joanna Blochowiak (Université de Genève) • Margarita Borreguero Zuloaga (Universidad Complutense de Madrid) • Chloé Braud (University of Copenhague) • Ludivine Crible (Université catholique de Louvain) • Iria da Cunha (Universidad Nacional de Educación a Distancia) • Laurence Danlos (Université Paris 7 Diderot) • Anna-Maria De Cesare (Universität Basel) • Gaétane Dostie (Université de Sherbrooke) • Benjamin Fagard (CNRS, ENS & Université Paris 3 Sorbonne Nouvelle) • Mar Garachana Camarero (Universitat de Barcelona) • Gaëtanelle Gilquin (Université catholique de Louvain) • Sonia Gómez-Jordana (Universidad Complutense de Madrid) • Antonio Hidalgo Navarro (Universitat de València) • Lydia-Mai Ho-Dac (Université de Toulouse) • Martin Hummel (Universität Graz) • Julia Lavid Lopez (Universidad Complutense de Madrid) • Diana Lewis (Université Aix-Marseille) • Araceli López Serena (Universidad de Sevilla) • Jacques Moeschler (Université de Genève) • Silvia Murillo Ornat (Universidad de Zaragoza) • Salvador Pons Bordería (Universitat de València) • Sorina Postolea (University of Iasi) • Laurent Prévot (Université Aix-Marseille) • Amalia Rodrígues Somolinos (Universidad Complutense de Madrid) • Sandrine Zufferey (Universität Bern)

Organising Committee (in alphabetical order)

Coordination: Liesbeth Degand, Université catholique de Louvain

Research Group Valibel – Discourse & Variation, Université catholique de Louvain [www.uclouvain.be/valibel]

Ludivine Crible, Université catholique de Louvain Barbara de Cock, Université catholique de Louvain Maďté Dupont, Université catholique de Louvain Julien Perrez, Université de Ličge Inés Recio Fernández, Universität Heidelberg Anne-Catherine Simon, Université catholique de Louvain

________________________________________ From: corpora-bounces at uib.no <corpora-bounces at uib.no> on behalf of corpora-request at uib.no <corpora-request at uib.no> Sent: 17 January 2017 01:04 To: corpora at uib.no Subject: {Spam?} Corpora Digest, Vol 115, Issue 23

Today's Topics:

1. Call for Papers: NLP for Computer-Assisted Language Learning

and Language Acquisition (Elena Volodina)

2. IULA adopts ISLRN (ELRA ELDA Information)

3. IWCS 2017 First Call for Papers (Eva Maria Vecchi)

4. Word Sense Disambiguation: a Unified Evaluation Framework

and Empirical Comparison (Alessandro Raganato)

5. Call for Papers: First Workshop on Abusive Language Online

(Zeerak Waseem)

6. Deadline Extended: Fourth Workshop on NLP for Similar

Languages, Varieties and Dialects (VarDial) at EACL 2017

(Marcos Zampieri)

7. 2nd Call for papers: NooJ 2017 International Conference

(NooJ 2017)


Message: 1 Date: Mon, 16 Jan 2017 12:31:37 +0100 From: Elena Volodina <elena.volodina at svenska.gu.se> Subject: [Corpora-List] Call for Papers: NLP for Computer-Assisted

Language Learning and Language Acquisition To: corpora at uib.no

[with apologies for cross-posting]

Joint 6th NLP4CALL and 2nd NLP4LA, Nodalida 2017

NODALIDA <http://nodalida2017.se/> workshop, Gothenburg, Sweden,

22 May, 2017

*Description of the workshop*

Workshop website: https://spraakbanken.gu.se/eng/icall/joint6thNLP4CALL-2ndNLP4LA2017

For the second year in a row we are bringing two related themes of NLP for CALL and NLP for LA together. The goal of the joint workshop is to provide a meeting place for researchers working on language learning issues including both /empirical and experimental studies/ and /NLP-based applications/.

The *theme on Natural Language Processing (NLP) for Computer-Assisted Language Learning (NLP4CALL) *iis a meeting place for researchers working on the integration of Natural Language Processing and Speech Technologies in CALL systems and exploring the theoretical and methodological issues arising in this connection.

The intersection of Natural Language Processing and Speech/Dialogue Technology with Computer-Assisted Language Learning (CALL) brings ?understanding? of language to CALL tools, thus making CALL intelligent. This fact has given the name for this area of research ? Intelligent CALL, ICALL. As the definition suggests, apart from having excellent knowledge of Natural Language Processing and/or Speech/Dialogue Technology, ICALL researchers need good insights into the second language acquisition (SLA) theories and practices, as well as knowledge of second language pedagogy and didactics. This workshop invites therefore all ICALL-relevant research, including studies where NLP-enriched tools are used for testing SLA and pedagogical theories, and vice versa, where SLA theories/pedagogical practices are modeled in ICALL tools.

The *workshop on Natural Language Processing (NLP) for Research in Language Acquisition (NLP4LA)* broadens the scope of this year?s joint workshop to also include theoretical, empirical, and experimental investigation of first, second and bilingual language acquisition.

We believe that this field will benefit from collaboration between the NLP, linguistics, psychology and cognitive science communities. The workshop is targeted at anyone interested in the relevance of computational techniques for first, second and bilingual language acquisition. Therefore, our aim is to bring together researchers from different fields with a shared interest in language acquisition.

The NLP4CALL&LA workshop series is aimed at bringing together competences from these areas for sharing experiences and brainstorming the future of the field.

*For the two tracks we welcome papers: *

* that describe research directly aimed at ICALL

* that demonstrate actual or discuss the potential use of existing

Speech Technologies, NLP tools or resources for language learning

* that describe the ongoing development of resources and tools with

potential usage in ICALL, either directly in interactive

applications, or indirectly in materials, application or curriculum

development, e.g. collecting and annotating ICALL-relevant corpora;

developing tools and algorithms for readability analysis, selecting

optimal corpus examples, etc.

* that discuss challenges and/or research agenda for ICALL

* that describe empirical studies on language learner data

* that describe computational models of first, second and bilingual

language acquisition

* that describe empirical or experimental studies, or computational

models of various aspects of language and their effect in language

comprehension and acquisition

* that demonstrate actual or discuss the potential use of Speech

Technologies, NLP tools or resources for investigating language


* that describe psycholinguistic and socio-linguistic investigations

on first, second and bilingual language acquisition

We especially invite submissions describing the above-mentioned themes for the Nordic languages; and papers that focus on different age groups, cultures, and language variation. We are also interested in software demonstrations.

*Submission information*

We will be using Nodalida 2017 template <http://stp.lingfil.uu.se/%7Ebea/nodalida17/> for the workshop this year. Authors are invited to submit long papers (8-12 pages) alternatively short papers (4-7 pages), page count not including references. Only pdf files will be accepted. Submissions will be managed through the electronic conference management system EasyChair <https://spraakbanken.gu.se/eng/icall/joint6thNLP4CALL-2ndNLP4LA2017>. Final camera-ready versions of accepted papers will be given an additional page to address reviewer comments.

Papers should describe original unpublished work or work-in-progress. Every paper will be reviewed by at least 2 members of the program committee. As reviewing will be blind, please ensure that papers are anonymous. Self-references that reveal the author's identity, e.g., "We previously showed (Smith, 1991) ...", should be avoided. Instead, use citations such as "Smith previously showed (Smith, 1991) ...". Submissions will be judged on appropriateness, clarity, originality/innovativeness, correctness/soundness, meaningful comparison, significance and impact of ideas or results.

All accepted papers will be collected into a proceedings volume to be submitted for publication in the NEALT Proceeding Series (Linköping Electronic Conference Proceedings) and, additionally, double-published through ACL anthology, following experiences from previous workshops, e.g. the 3rd NLP4CALL <http://aclanthology.info/events/ws-2015#W15-19>.

*Important dates: *

* *13 January*, Friday: first call for papers

* *13 February*, Monday: second call for papers

* *6 March*, Monday: third call for papers

* *20 March*, Monday: paper submission deadline (short and long)

* *6 April*, Thursday: notification of acceptance

* *13 April*, Thursday: camera-ready papers for publication

* *4 May*, Thursday: call for participation

* *22 May*, Monday: workshop date

For further information, see workshop website:

https://spraakbanken.gu.se/eng/icall/joint6thNLP4CALL-2ndNLP4LA2017 -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 9538 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20170116/323cf6f5/attachment.txt>


Message: 2 Date: Mon, 16 Jan 2017 12:52:26 +0100 From: ELRA ELDA Information <info at elda.org> Subject: [Corpora-List] IULA adopts ISLRN To: destinataires inconnus: ;

/Press Release ? Immediate Paris, France, January 16, 2017/

*The Institute for Applied Linguistics (IULA) at the Universitat Pompeu Fabra (UPF) adopts the International Standard Language Resource Number (ISLRN) initiative*

The *Institute for Applied Linguistics (IULA) <https://www.iula.upf.edu/indexuk.htm>* at the *Universitat Pompeu Fabra (UPF)* <https://www.upf.edu/web/universitat> is now a certified provider to the ISLRN system. This means that IULA can apply for ISLRNs on behalf of the developers of the data that is managed and distributed via the IULA network. IULA has already submitted 107 language resources to the ISLRN. These include monolingual and multilingual written corpora and lexica for the languages spoken in Spain (Aragonese, Asturian, Basque, Castilian Spanish, Catalan, Galician), as well as other European languages (English, French, German, Greek, Italian, Occitan, Portuguese, Romanian), and Esperanto. The meta-information for these language resources is also available on the ISLRN website with a broad international audience.

*Background* As part of an international effort to document and archive the various language resource development efforts around the world, a system of assigning ISLRNs was established in November 2013. The ISLRN is a unique ?persistent identifier? to be assigned to each language resource. The establishment of ISLRNs was a major step in the networked and shared world of human language technologies. Unique resources must be identified as they are, and meta-catalogues require a common identification format to manage data correctly. Therefore, language resources should carry identical identification schemes independent of their representations, whatever their types and wherever their physical locations (on hard drives, internet or intranet) (*http://islrn.org/*).

*About IULA* The Institute for Applied Linguistics (IULA), created in 1994 at the Universitat Pompeu Fabra, is a research and training center that gets together about 50 researchers. The group IULA Technologies of Language Resources (TRL) is working on the creation, development and application of technologies related to the acquisition, production, management, validation and evaluation of Language Resources as required by Language Technologies applications such as Machine Translation, Information Extraction, Text Analytics, etc. Resources produced by TRL are licensed under Creative Commons and they are available for direct download at UPF's eRepository (*http://repositori.upf.edu/handle/10230/5963*).

To find out more about IULA and UPF, please visit the following websites: *http://www.iula.upf.edu/ *and *https://www.upf.edu/web/universitat*

*About ELRA* The European Language Resources Association (ELRA) is a non-profit-making organisation founded by the European Commission in 1995, with the mission of providing a clearing house for language resources and promoting human language technologies. To find out more about ELRA, please visit the website: *http://www.elra.info <http://portal.elda.org/>*

Contact: info at elda.org

-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 4382 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20170116/15c2f7f2/attachment.txt>


Message: 3 Date: Mon, 16 Jan 2017 12:36:01 +0100 From: Eva Maria Vecchi <evamariavecchi at gmail.com> Subject: [Corpora-List] IWCS 2017 First Call for Papers To: corpora at uib.no

12th INTERNATIONAL CONFERENCE ON COMPUTATIONAL SEMANTICS (IWCS) University of Montpellier 19-22 September 2017 http://www.lirmm.fr/iwcs2017 <http://www.lirmm.fr/iwcs2017>


Long and short paper submission deadline: 15 April 2017

The aim of the IWCS conference is to bring together researchers interested in the computation, annotation, extraction, and representation of meaning in natural language, whether from a lexical or structural semantic perspective. IWCS embraces both symbolic and statistical approaches to computational semantics, and everything in between.


The areas of interest for the conference include all computational aspects of meaning of natural language within written, spoken, or multimodal communication. Papers are invited on topics in these and closely related areas, including the following:

? representation of meaning

? syntax-semantics interface

? representing and resolving semantic ambiguity

? shallow and deep semantic processing and reasoning

? hybrid symbolic and statistical approaches to representing semantics

? alternative approaches to compositional semantics

? inference methods for computational semantics

? recognizing textual entailment

? deep learning and semantics

? learning by reading

? methodologies and practices for semantic annotation

? machine learning of semantic structures

? statistical semantics

? computational aspects of lexical semantics

? semantics and ontologies

? semantic web and natural language processing

? semantic aspects of language generation

? semantic relations in discourse and dialogue

? semantics and pragmatics of dialogue acts

? multimodal and grounded approaches to computing meaning

? semantics-pragmatics interface


Two types of submission are solicited:

Long papers:

? formatting instructions http://www.lirmm.fr/iwcs2017/iwcs_instructions.html <http://www.lirmm.fr/iwcs2017/iwcs_instructions.html>

? submisssion: https://easychair.org/conferences/?conf=iwcs2017 <https://easychair.org/conferences/?conf=iwcs2017>

? deadline 15 april 2017 Long papers should describe original research and must not exceed 10 pages references excluded ? see formatting instructions. The program committee may accept as a short paper a paper intially submitted as a long paper if the committee thinks it is better suited. When fully accepted, long papers will be published in the conference proceedings and in the ACL Anthology, and will have a full oral presentation at the conference. When accepted as short papers they will be offered the same publication and presentation possibilities as accepted short papers.

Short papers

? formatting instructions http://www.lirmm.fr/iwcs2017/iwcs_instructions.html <http://www.lirmm.fr/iwcs2017/iwcs_instructions.html>

? submisssion: https://easychair.org/conferences/?conf=iwcs2017 <https://easychair.org/conferences/?conf=iwcs2017>

? deadline 15 april 2017 Short papers typically are system description with demonstration or project descriptions, or ongoing research and must not exceed 5 pages references excluded ? see formatting instructions. Accepted short papers will be published in the conference proceedings and in the ACL Anthology, and will have a lightning talk at the conference, followed by a poster/demo session for discussion.

A REMARK ON DUAL SUBMISSION PAPERS that have been or will be submitted to other meetings or publications must indicate this at submission time. Authors of papers accepted for presentation at IWCS 2017 must notify the program chairs by the camera-ready deadline as to whether the paper will be presented. All accepted papers must be presented at the conference to appear in the proceedings. We will not accept for publication or presentation papers that overlap significantly in content or results with papers that will be (or have been) published elsewhere.

Preprint servers such as arXiv.org or HAL and workshops without published proceedings are not considered as prior publications for this purpose. Authors should state in the online submission form the name of the workshop or preprint server and title of any such non-archival version, so that reviewers can be informed appropriately.

Please follow the information for authors for instructions on formatting your paper. http://www.lirmm.fr/iwcs2017/iwcs_instructions.html <http://www.lirmm.fr/iwcs2017/iwcs_instructions.html> Papers should be electronically submitted in PDF format via easychair: https://easychair.org/conferences/?conf=iwcs2017 <https://easychair.org/conferences/?conf=iwcs2017>

There will be a separate call for workshops .

IWCS 2017 program committee:

? co-chair Claire Gardent CNRS LORIA Nancy

? co-chair Christian Retoré Université de Montpellier & LIRMM CNRS

? workshop chair: Richard Moot CNRS LIRMM & Université de Montpellier

? hackathon chair: to be announced

? see web site for complete program committee http://www.lirmm.fr/iwcs2017/iwcs_2017_program_committee.html <http://www.lirmm.fr/iwcs2017/iwcs_2017_program_committee.html>


? 15 April 2017 submissions due (long and short)

? 13 July 2017 Notification of acceptance

? 4 Septembre 2017 Final camera-ready papers due

? 19 Septembre 2017 IWCS Workshops

? 20-21-22 Septembre IWCS main conference -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 10310 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20170116/621c26cf/attachment.txt>


Message: 4 Date: Mon, 16 Jan 2017 15:12:29 +0100 From: Alessandro Raganato <raganato at di.uniroma1.it> Subject: [Corpora-List] Word Sense Disambiguation: a Unified

Evaluation Framework and Empirical Comparison To: corpora at uib.no

The unified evaluation framework for Word Sense Disambiguation (WSD) is available at http://lcl.uniroma1.it/wsdeval .

We have gathered together five popular all-words WSD evaluation datasets and two training datasets, standardizing their format and sense inventory, providing a unified evaluation framework. WSD is a long-standing task in Natural Language Processing, lying at the core of human language understanding. However, the field seems to be slowing down due to the lack of groundbreaking improvements. We argue that this is partly due to the lack of a standard benchmark, which prevents new approaches to be easily compared with old approaches. Current benchmarks tend to differ in format, construction guidelines and underlying sense inventory.

In our work we used this framework to perform an empirical comparison among a set of heterogeneous approaches, including latest advances based on neural networks. All supervised approaches were trained on the same preprocessed corpora, ensuring a fair comparison among all systems. Additionally, we have enabled a competition in CodaLab <https://competitions.codalab.org/competitions/15984> for testing new models (or models not considered in our empirical comparison).

If you would like to contribute to the framework with sense-annotated training data or other evaluation datasets, you can share it with us (instructions in the website <http://lcl.uniroma1.it/wsdeval/share-your-data>).

Let?s make WSD great again! :)

For more information, please read the reference paper:

Alessandro Raganato, Jose Camacho-Collados and Roberto Navigli.

Word Sense Disambiguation: A Unified Evaluation Framework and Empirical Comparison <http://lcl.uniroma1.it/wsdeval/data/EACL17_WSD_EvaluationFramework.pdf>. Proceedings of EACL 2017, Valencia, Spain

-- ===================================== Alessandro Raganato Dipartimento di Informatica Sapienza University of Rome Viale Regina Elena 295 00161 Roma Italy Home Page: http://wwwusers.di.uniroma1.it/~raganato ===================================== -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 6357 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20170116/f4a05478/attachment.txt>


Message: 5 Date: Mon, 16 Jan 2017 19:37:43 +0100 From: Zeerak Waseem <zeerak.w at gmail.com> Subject: [Corpora-List] Call for Papers: First Workshop on Abusive

Language Online To: corpora at uib.no

********************************************************** ALW1: 1st Workshop on Abusive Language Online to be held at ACL 2017 (Vancouver, Canada), August 3rd or 4th, 2017 Submission deadline: April 27, 2017 https://sites.google.com/site/abusivelanguageworkshop2017/ <https://sites.google.com/site/abusivelanguageworkshop2017/> **********************************************************

Overview The last few years have seen a surge in abusive online behavior, with governments, social media platforms, and individuals struggling to cope with the consequences. Online forums, comment sections, and social media interaction in general have become a playground of bullying, scapegoating, and hate speech. These forms of online aggression not only poison the social climate of the communities that experience it, but also lower the inhibition for direct physical violence, and increasingly even result in it.

As a field that directly works with computing over language, Natural Language Processing researchers are in a unique position to develop automated methods to analyze, detect, and filter abusive language. Additionally, we recognize that addressing abusive language is not solely the purview of NLP approaches but is a truly multi-disciplinary problem and thus requires knowledge from other fields, including but not limited to: psychology, sociology, law, gender studies, digital communication, and critical race theory.

In this one day workshop, we aim to provide a space for researchers of various disciplines to meet and discuss approaches to abusive language. The workshop will include invited speakers and panelists from fields outside of NLP, as well as solicit papers from researchers across all areas. In addition, the workshop will host an ?unshared task?.

Paper Topics

We invite long and short papers on any of the following general topics: - NLP models and methods for abusive language detection - Application of NLP tools to analyze social media content and other large data sets - NLP models for cross-lingual abusive language detection - The social and personal consequences of being the target of abusive language and targeting others with abusive language - Assessment of current non-NLP methods of addressing abusive language - Legal ramifications of measures taken against abusive language use - Best practices for using NLP techniques in watchdog settings - Development of corpora and annotation guidelines

Unshared task

In order to encourage focused contributions, we direct researchers to consider the following list of data sets an unshared task, where participants can choose from a list of datasets to conduct their experiments. This list includes: Twitter Data Set [Waseem and Hovy, NAACL 2016] German Twitter Data Set [Ross et al. NLP4CMC 2016] Wikipedia Abusive Language Data Set [Wulczyn et al., Preprint available here <https://arxiv.org/abs/1610.08914>]

Submission Information

We will be using the ACL 2017 Submission Guidelines and the START conference system to manage submissions: https://www.softconf.com/acl2017/alw/ <https://www.softconf.com/acl2017/alw/>

Authors are invited to submit a full paper of up to 8 pages of content with up to 2 additional pages for references. We also invite short papers of up to 4 pages of content, including 2 additional pages for references. Accepted papers will be given an additional page of content to address reviewer comments. We also invite papers which describe systems are also invited to give a demo of their system. If you would like to present a demo in addition to presenting the paper, please make sure to select either "full paper + demo" or "short paper + demo" under "Submission Category" in the START submission page.

Previously published papers cannot be accepted. The submissions will be reviewed by the program committee. As reviewing will be blind, please ensure that papers are anonymous. Self-references that reveal the author's identity, e.g., "We previously showed (Smith, 1991) ...", should be avoided. Instead, use citations such as "Smith previously showed (Smith, 1991) ...".

We have also included conflict of interest in the submission form. You should mark all potential reviewers who have been authors on the paper, are from the same research group or institution, or who have seen versions of this paper or discussed it with you.

Organizing Committee - Wendy Hui Kyong Chun, Brown University - Dirk Hovy, University of Copenhagen - Joel Tetreault, Grammarly - Zeerak Waseem, University of Copenhagen

Program Committee/Reviewers The following researchers have agreed to serve on the program committee as reviewers.

Swati Agarwal, IIIT Delhi, India Fiona Barnett, Duke University, USA Darina Benikova, University of Duisburg-Essen, LTL, Germany Simone Browne, UT Austin, USA Anneke Buffone, University of Pennsylvania, USA Pete Burnap, Cardiff University, United Kingdom Christina Capodilupo, Teachers College, Columbia University, USA Guillermo Carbonell, University Duisburg-Essen, Germany Pedro Cardoso, Synthesio, France Gabriella Coleman, McGill, Canada Bart Desmet, LT3, Ghent University, Belgium Lucas Dixon, Jigsaw, USA Nemanja Djuric, Uber ATC, USA Jacob Eisenstein, Georgia Institute of Technology, USA Hugo Jair Escalante, INAOE, Mexico Lucie Flekova, UKP Lab, TU Darmstadt, Germany Matthew Fuller, Goldsmith, UK Lee Gillam, University of Surrey, United Kingdom Jen Golbeck, University of Maryland, USA Kevin Hamilton, University of Illinois, USA Christopher Homan, Rochester Institute of Technology, USA Veronique Hoste, Ghent University, Belgium Els Lefever, LT3, Ghent University, Belgium Shuhua Liu, Arcada University of Applied Sciences, Finland Elizabeth Losh, College of William and Mary, USA Shervin Malmasi, Harvard Medical School, USA Fumito Masui, Kitami Institute of Technology, Japan Yashar Mehdad, Airbnb, USA Rada Mihalcea, University of Michigan, USA Mainack Mondal, Max Planck Institute for Software Systems, Germany Manuel Montes-y-Gómez, INAOE, Mexico Kevin Munger, NYU, USA Preslav Nakov, Qatar Computing Research Institute, HBKU, Qatar Courtney Napoles, Johns Hopkins University, USA Chikashi Nobata, Apple, USA Guy De Pauw, CLiPS - University of Antwerp, Belgium Whitney Phillips, Mercer University, USA Karolien Poels, University of Antwerp, Belgium Martin Potthast, Bauhaus-Universität Weimar, Germany Daniel Preotiuc-Pietro, University of Pennsylvania, USA Michal Ptaszynski, Kitami Institute of Technology, Japan Awais Rashid, Lancaster University, United Kingdom Björn Ross, University Duisburg-Essen, Germany Paolo Rosso, Universitat Politecnica de Valencia, Spain Masoud Rouhizadeh, Stony Brook University & University of Pennsylvania, USA Molly Sauter, McGill University, Canada Nishant Shah, Leuphana, ArtEZ University of the Arts, CIS (Bangalore), India Thamar Solorio, University of Houston, USA Jeffrey Sorensen, Jigsaw, USA Achint Thomas, Embibe Indiavidual Inc, India Lyle Ungar, University of Pennsylvania, USA Anna Vartapetiance, University of Surrey, United Kingdom Erik Velldal, University of Oslo, Norway Ingmar Weber, Qatar Computing Research Institute, Qatar Jacque Wernimont, Arizona State University, USA Michael Wojatzki, University of Duisburg-Essen, Germany Lilia Řvrelid, University of Oslo, Norway -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 8889 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20170116/0a6d6781/attachment.txt>


Message: 6 Date: Mon, 16 Jan 2017 20:31:32 +0100 From: Marcos Zampieri <marcos.zampieri at uni-saarland.de> Subject: [Corpora-List] Deadline Extended: Fourth Workshop on NLP for

Similar Languages, Varieties and Dialects (VarDial) at EACL 2017 To: corpora at uib.no, ln at cines.fr, sigir at acm.org, elsnet-list at elsnet.org

Deadline Extended: January 23, 2017

VarDial - Fourth Workshop on NLP for Similar Languages, Varieties and Dialects

VarDial 2017: http://ttg.uni-saarland.de/vardial2017/ Shared Tasks: http://ttg.uni-saarland.de/vardial2017/sharedtask2017.html

Co-located with EACL 2017, in VarDial we anticipate discussion on computational methods and on language resources for closely related languages and language varieties. Corpus-driven studies on different degrees of linguistic variation such as lexicon and grammar is another topic of interest.

Within the scope of the workshop, we are organising an evaluation campaign with four shared tasks on similar languages, varieties and dialects. Check the website for more information.


We welcome papers dealing with one or more of the following topics:

- Language resources and tools for similar languages, varieties and dialects; - Adaptation of tools (taggers, parsers) for similar languages, varieties and dialects; - Evaluation of language resources and tools when applied to language varieties; - Reusability of language resources in NLP applications (e.g., for machine translation, POS tagging, syntactic parsing, etc.); - Corpus-driven studies in dialectology and language variation; - Automatic identification of lexical variation; - Automatic classification of language varieties; - Text similarity and adaptation between language varieties; - Linguistic issues in the adaptation of language resources and tools (e.g., semantic discrepancies, lexical gaps, false friends); - Machine translation between closely related languages, language varieties and dialects.

Instruction for Authors

Submissions should be formatted according to the EACL template and submitted in PDF format. Papers should contain maximum 10 pages (8 pages of content + 2 pages for references). The review process will be double-blind.

Important Dates

Submission Deadline: January 23, 2017 (extended) Acceptance Notification: February 11, 2017 Camera-ready versions: February 21, 2017 Workshop: April 3, 2017

Program Committee

?eljko Agi? (IT University of Copenhagen, Denmark) Cesar Aguilar (Pontifical Catholic University of Chile, Chile) Laura Alonso y Alemany (University of Cordoba, Argentina) Tim Baldwin (University of Melbourne, Australia) Jorge Baptista (University of Algarve and INESC-ID, Portugal) Eckhard Bick (University of Southern Denmark, Denmark) Francis Bond (Nanyang Technological University, Singapore) Aoife Cahill (Educational Testing Service, United States) David Chiang (University of Notre Dame, United States) Paul Cook (University of New Brunswick, Canada) Marta Costa-Jussŕ (Universitat Politčcnica de Catalunya, Spain) Jon Dehdari (Saarland University and DFKI, Germany) Liviu Dinu (University of Bucharest, Romania) Stefanie Dipper (Ruhr University Bochum, Germany) Sascha Diwersy (University of Montpellier, France) Mark Dras (Macquire University, Australia) Toma? Erjavec (Jo?ef Stefan Institute, Slovenia) Mikel L. Forcada (Universitat d'Alacant, Spain) Binyam Gebrekidan Gebre (Phillips Research, Netherlands) Cyril Goutte (National Research Council, Canada) Nizar Habash (New York University Abu Dhabi, UAE) Chu-Ren Huang (Hong Kong Polytechnic University, Hong Kong) Jeremy Jancsary (Nuance Communications, Austria) Lung-Hao Lee (National Taiwan Normal University, Taiwan) Marco Lui (Rome2Rio Ltd., Australia) Teresa Lynn (Dublin City University, Ireland) John Nerbonne (University of Groningen, Netherlands and University of Freiburg, Germany) Graham Neubig (Carnegie Mellon University, United States) Kemal Oflazer (Carnegie Mellon University in Qatar, Qatar) Maciej Ogrodniczuk (IPAN, Polish Academy of Sciences, Poland) Petya Osenova (Bulgarian Academy of Sciences, Bulgaria) Reinhard Rapp (University of Mainz, Germany and University of Aix-Marsaille, France) Paolo Rosso (Polytechnic University of Valencia, Spain) Tanja Samard?i? (University of Geneva, Switzerland) Felipe Sánchez Martínez (Universitat d'Alacant, Spain) Kevin Scannell (Saint Louis University, United States) Yves Scherrer (University of Geneva, Switzerland) Serge Sharoff (University of Leeds, United Kingdom) Kiril Simov (Bulgarian Academy of Sciences, Bulgaria) Milena Slavcheva (Bulgarian Academy of Sciences, Bulgaria) Marco Tadi? (University of Zagreb, Croatia) Elke Teich (Saarland University, Germany) Joel Tetreault (Grammarly, United States) Francis Tyers (UiT Norgga árktala? universitehta, Norway) Du?ko Vitas (University of Belgrade, Serbia) Taro Watanabe (Google Inc., Japan) Pidong Wang (Machine Zone Inc., United States)


Preslav Nakov (Qatar Computing Research Institute, Qatar) Marcos Zampieri (University of Cologne, Germany) Nikola Ljube?ic (Jo?ef Stefan Institute, Slovenia and University of Zagreb, Croatia) Jörg Tiedemann (University of Helsinki, Finland) Shervin Malmasi (Harvard Medical School, United States) Ahmed Ali (Qatar Computing Research Institute, Qatar)

Contact: vardialworkshop at gmail.com -- Marcos Zampieri Saarland University http://www.marcoszampieri.com


Message: 7 Date: Mon, 16 Jan 2017 23:57:26 +0000 From: NooJ 2017 <nooj2017 at gmail.com> Subject: [Corpora-List] 2nd Call for papers: NooJ 2017 International

Conference To: CORPORA at uib.no

Second Call for Papers


NooJ 2017 International Conference

Kenitra, Morocco

May 18-20, 2017



Important dates:


Abstract submission: *17 January 2017*

Notification of acceptance: 15 March 2017

Registration: until 15 April 2017

Conference dates: 18-20 May, 2017


The Information Modeling and Communication System Laboratory (MISC), Faculty of science, Ibn Tofail University, in cooperation with International Agency for Natural Language Processing (IA4NLP), Le laboratoire ELLIADD de l'Université de Franche-Comté, Besançon, and The NOOJ association organize the NooJ 2017 International Conference in Kenitra, Morocco from 18-20 May, 2017.

NooJ is both a corpus processing tool and a linguistic development environment: it allows linguists to formalize several levels of linguistic phenomena: orthography and spelling; lexicons of simple words; multiword units and frozen expressions; inflectional, derivational and productive morphology; local, structural syntax and transformational syntax. For each of these levels NooJ provides linguists with one or more formal tools specifically designed to facilitate the description of each phenomenon, as well as parsing tools designed to be as computationally efficient as possible. This approach distinguishes NooJ from most computational linguistic tools, which provide a single formalism that could describe every linguistic phenomena. As a corpus processing tool, NooJ allows users to apply sophisticated linguistic queries to large corpora in order to build indices and concordances, annotate texts automatically, perform statistical analyses, etc.

NooJ is freely available and linguistic modules can already be freely downloaded for Acadian, Arabic, Armenian, Bulgarian, Catalan, Chinese, Croatian, French, English, German, Hebrew, Greek, Hungarian, Italian, Polish, Portuguese, Spanish and Turkish. A dozen other modules are under construction.


The conference intends to:


* give NooJ users and researchers in Linguistics and in Computational Linguistics the opportunity to meet and share their experience as developers, researchers and teachers;

* present to NooJ users the latest linguistic resources and NLP applications developed for/with NooJ, its latest functionalities, as well as its future developments;

* offer researchers and graduate students two tutorials (one basic and one advanced) to help them parse corpora and build NLP applications using NooJ.

* provide the occasion to present and discover the recent developments of NooJ itself (v5).


Topics of interest:


* Linguistic resources:

Typography, Spelling, Syllabification, Phonemic and Prosodic transcription, Morphology, Lexical Analysis, Local Syntax, Structural Syntax, Transformational Analysis, Paraphrase Generation, Formal Semantic Annotations and Analyses, Description of rare and endangered languages.

* Corpus processing:

Corpus Linguistics, Information extraction, Discourse Analysis, Business Intelligence, NLP applications, Applications of NooJ in Social Sciences.




We invite the submission of abstracts in English until 17 January 2017. The abstracts should contain the title, name, institution and email of the author(s). Abstracts should not exceed one page (length between 300 and 600 words, references not included) and should be sent via EasyChair. The easychair web page is: https://easychair.org/conferences/?conf=nooj2017. All proposals will be reviewed by the scientific committee and authors will be given notice of acceptance of their papers no later than 15 March 2017. Further information about the conference can be found at www.uit.ac.ma/nooj17. You can also contact the organizing committee at nooj17 at gmail.com for any additional information.


Scientific Committee:


* Xavier Blanco (Autonomous University of Barcelona, Spain)

* Mohammed El Hannach (Sidi Mohammed Ben Abdellah University, Morocco)

* Hela Fehri (University of Gabes, Tunisia)

* Yuras Hetsevich (United Institute of Informatics Problems, Belarus)

* Kristina Kocijan (University of Zagreb, Croatia)

* Svetla Koeva (University of Sofia, Bulgaria)

* Peter Machonis (Florida International University, USA)

* Samir Mbarki (Ibn Tofail University, Morocco)

* Slim Mesfar (University of Manouba, Tunisia)

* Mohammed Mourchid (Ibn Tofail University, Morocco)

* Mario Monteleone (University of Salerno, Italy)

* Johanna Monti (University of Sassari, Italy)

* Mohammed Outahajala (Institut Royal de la Culture AMazigh, Morocco)

* Jan Radimský (University of South Bohemia, Czech republic)

* Azeddine Rhazi (Cadi Ayyad University, Morocco)

* Max Silberztein (Université de Franche-Comté, France)

* François Trouilleux (Université Blaise-Pascal, France)


Organizing Committee:


* Mohamed El Hannach (Sidi Mohamed Ben Abdellah University, Morocco)

* Abdelaaziz Mouloudi (Ibn Tofail University, Morocco)

* Mohamed Mourchid (Ibn Tofail University, Morocco)

* Samir Mbarki (Ibn Tofail University, Morocco)

* Mohamed Outahajala (Institut Royale de la culture AMazighe, Morocco)

* Max Siberztein (Université Franche-Comté, France)

* Slim Mesfar (University of Manouba, Tunisia)

* Héla Fehri (Université of Gabes, Tunisia) -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 12368 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20170116/bbe3acce/attachment.txt>

---------------------------------------------------------------------- Send Corpora mailing list submissions to

corpora at uib.no

To subscribe or unsubscribe via the World Wide Web, visit

http://mailman.uib.no/listinfo/corpora or, via email, send a message with subject or body 'help' to

corpora-request at uib.no

You can reach the person managing the list at

corpora-owner at uib.no

When replying, please edit your Subject line so it is more specific than "Re: Contents of Corpora digest..."

_______________________________________________ Corpora mailing list Corpora at uib.no http://mailman.uib.no/listinfo/corpora

End of Corpora Digest, Vol 115, Issue 23 ****************************************

More information about the Corpora mailing list