[Corpora-List] MomenT 2018 Workshop @ LREC: Multilingualism at the intersection of Knowledge Bases and Machine Translation

Mihael mihael.arcan at insight-centre.org
Mon Nov 6 12:29:40 CET 2017

/[apologies for cross-postings]/

1st Call for Papers: *Multilingualism at the intersection of Knowledge Bases and Machine Translation* (1st MomenT Workshop)

*Date: May 12th, 2018*. To be held as part of the 11th edition of the Language Resources and Evaluation Conference (LREC), at the Phoenix Seagaia Resort, Miyazaki, Japan.

Website: http://moment2018.insight-centre.org/

*Submission Deadline : January 12th, 2018*

*Workshop Description*

The increasing availability of knowledge bases, generated by researchers in the academia and industry, enables extensions of NLP applications with the vast amount of background knowledge available in open formats. However, most of the information that we find in knowledge bases, like knowledge graphs, ontologies, taxonomies, thesauri, dictionaries or terminological datasets, contain descriptions in natural language, mostly only in English, which severely limits their exploitation. Machine translation (MT) can be applied to automatically translate the terminological expressions into multiple languages, but not without its challenges. Usually, they contain domain-specific terminology, which is notoriously difficult to deal with since it is underrepresented in the MT training data.

On the other hand, growing attention has also been paid to the integration of domain-specific knowledge into MT systems or CAT tools. To reduce the post-editing effort involved in the translation process, approaches that enhance the MT systems with existing multilingual domain-specific knowledge, (e.g. DBpedia, IATE or ETB) are becoming increasingly popular. In this sense, the provision of multilingual knowledge bases that could be integrated better into the MT pipeline is a crucial step towards increasing the translation quality of highly-specific texts.

We believe that in the same way that domain ontologies and terminologies can benefit from NLP tools and MT systems that help to overcome the language barrier, NLP tools and MT systems would also profit from the specialized semantic knowledge and the terms that are captured in multilingual ontologies or terminologies. The workshop, therefore, aims at building a bridge between the knowledge base and machine translation communities.


This workshop aims to offer a forum for discussions to researchers who find themselves at the intersection of the two strands of research, namely, knowledge representations of high-specialized domains (represented by domain ontologies, terminologies, dictionaries) and MT, where the intersection point is precisely multilingualism. On the one hand, there is an impending need to cater for multilingual knowledge resources that support NLP, content analytics or information extraction across languages. On the other hand, there is a growing interest in exploiting the knowledge captured in such resources for MT systems that often lack the knowledge of domain-specific expressions.

We would like to bring together researchers from different communities as well as other interested stakeholders from the industry and the public sector in order to identify common interests, exploit synergies, provide use cases, and share methods, tools and resources. We are particularly interested to bring together researchers and practitioners working on MT, specifically statistical (SMT) or neural (NMT), on the one hand and experts on knowledge bases from various perspectives on the other, in order to enable cross fertilisation and foster the creation of innovative solutions that can only arise from interdisciplinary collaborations.

*Topics of Interest*

The workshop invites the submission of papers reporting on research on topics related to terminological/ontological resources processing in MT, including, but not limited to:

    Multilingual knowledge base generation with MT

    Injection of multilingual knowledge into MT

    Provenance of multiple multilingual resources for MT

    Terminological variations and MT

    MT used for cross-lingual information access, search and retrieval

    Neural MT applied to terminology translation

    Exploiting relational knowledge graphs for Neural MT

    Morphological variations of terminological expressions in MT

    Multilingual knowledge generation for under-resourced languages

    Enriching MT with Linked Data (BabelNet, DBpedia, UMLS, …)

    Multilingual terminology and linguistic resource development, integration and mapping in a Linked Data context

    Multilingual Semantic Web and Linked Data use cases

*Submission & Publication*

We accept submission of both long (up to 8 pages) and short papers (up to 4 pages) to be presented as long or short oral presentations at the workshop. The papers of the workshop will be published as online proceedings on the LREC 2018 website.

In addition, we aim for a journal special issue as post-conference proceedings in which a selected amount of papers presented at the workshop will be published.

When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a new result of your research.

For contact data, stylesheets, up-to-date details on submission and the workshop itself, please consult the workshop website (http://moment2018.insight-centre.org/).

_Identify, Describe and Share your LRs!_

Describing your LRs in the LRE Map is now a normal practice in the submission procedure of LREC (introduced in 2010 and adopted by other conferences). To continue the efforts initiated at LREC 2014 about “Sharing LRs” (data, tools, web-services, etc.), authors will have the possibility,  when submitting a paper, to upload LRs in a special LREC repository.  This effort of sharing LRs, linked to the LRE Map for their description, may become a new “regular” feature for conferences in our field, thus contributing to creating a common repository where everyone can deposit and share data.

As scientific work requires accurate citations of referenced work so as to allow the community to understand the whole context and also replicate the experiments conducted by other researchers, LREC 2018 endorses the need to uniquely Identify LRs through the use of the International Standard Language Resource Number (ISLRN, www.islrn.org), a Persistent Unique Identifier to be assigned to each Language Resource. The assignment of ISLRNs to LRs cited in LREC papers  will be offered at submission time.

*Important Dates:*

Paper submission deadline: January 12th 2018

Notification of Acceptance: February 9th 2018

Camera-ready paper: February 23rd 2018

Workshop date: May 12th 2018

*Organizing Committee:*

    Mihael Arcan, National University of Ireland, Galway, Ireland

    Elena Montiel-Ponsoda, Universidad Politécnica de Madrid, Spain

    Darja Fiser, University of Ljubljana, Jožef Stefan Institute, CLARIN ERIC

    Tatjana Gornostaja, Tilde, Latvia, ELRA, BDVA

For further information please contact: mihael.arcan at insight-centre.org

The workshop is supported by the CLARIN research infrastructure.

*Programme Committee:*

Guadalupe Aguado-de-Cea - Universidad Politécnica de Madrid, Spain Paul Buitelaar - National University of Ireland, Galway, Ireland Philipp Cimiano - Bielefeld University, Germany Christian Chiarcos - Goethe-Universität, Germany Béatrice Daille - University of Nantes, France Brian Davis - Maynooth University, Ireland Thierry Declerck - German Research Center for Artificial Intelligence (DFKI), Germany Arantza Diaz de Ilarraza Sanchez - University of the Basque Country, Spain Mauro Dragoni - Fondazione Bruno Kessler (FBK), Italy Tomaž Erjavec - Jožef Stefan Institute, Slovenia Miquel Esplà-Gomi - Universitat d'Alacant, Spain Gema Ramírez - Prompsit Language Engineering, Spain Natalia Grabar - Université de Lille, France Jorge Gracia - Universidad Politécnica de Madrid, Spain Miloš Jakubiček - University in Brno / Lexical Computing Limited, Czech Republic John Judge - Dublin City University, Ireland Ilan Kernerman - K Dictionaries, Israel Iztok Kosem - Trojina, Institute for Applied Slovene Studies, Slovenia Simon Krek - Jožef Stefan Institute, Slovenia Els Lefever - Ghent University, Belgium Qun Liu - Dublin City University, Ireland & Chinese Academy of Sciences, China John P. McCrae - National University of Ireland, Galway, Ireland Roberto Navigli - Sapienza University of Rome, Italy Mārcis Pinnis - Tilde, Latvia Laurette Pretorius - University of South Africa (UNISA), South Africa Georg Rehm - German Research Center for Artificial Intelligence (DFKI), Germany Antonio Toral - University of Groningen, Netherlands Marco Turchi - Fondazione Bruno Kessler (FBK), Italy Špela Vintar - University of Ljubljana, Slovenia -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 11099 bytes Desc: not available URL: <https://www.uib.no/mailman/public/corpora/attachments/20171106/ac5a3223/attachment.txt>

More information about the Corpora mailing list