The Fifth Workshop on Hybrid Approaches to Translation (HyTra-5), in conjunction with EAMT 2016, intends to continue enriching and deepening the research agenda in the area of Hybrid MT. In the last four editions, prestigious invited speakers, full paper and poster sessions have allowed for fruitful, creative and interdisciplinary discussions, comparing and contrasting diverse ways of integrating different MT paradigms to moving forward the state-of-the-art in Machine Translation. In addition to the proceedings, a first contributed volume, composed of papers selected from the first three editions of HyTra, is currently undergoing its final editorial review and will be soon published by Springer.
HyTra motivates the cooperation and interaction between the different groups involved in this highly interdisciplinary and multidisciplinary field, namely translators, engineers, computer scientists, mathematicians and linguists, as well as the fostering of innovation and creativity in the Hybrid Machine Translation research community.
Given the complementarity and mutual attractiveness of data-driven and rule-based MT, it is natural that the boundaries among them have narrowed. The question is what the combined architecture or architectures should look like. In the past few years, in the MT community, the interest in hybridization and system combination has significantly increased.
We solicit contributions including but not limited to the following topics:
· ways and techniques of hybridization
· architectures for the rapid development of hybrid MT systems
· applications of hybrid systems
· hybrid systems dealing with under-resourced languages
· hybrid systems dealing with morphologically rich languages
· using linguistic information (morphology, syntax, semantics) to enhance statistical MT (e.g. with hierarchical or factored models)
· bootstrapping rule-based systems from corpora
· extraction of dictionaries from parallel and comparable corpora
· induction of morphological, grammatical, and translation rules from corpora
· improving MT with statistical and rule-based computational linguistics methods (word sense disambiguation, information extraction, terminology mining, metaphor recognition, etc.)
· machine learning techniques for hybrid MT and complex data structures
· describing and using structural mappings between languages (e.g. tree-structures using synchronous/transduction grammars)
· system combination approaches such as multi-engine MT (parallel) or automatic post-editing (sequential)
· hybrid methods in spoken language translation
· heuristics for limiting the search space in hybrid MT
· translation of user generated contents
· new methods for the fair evaluation of the output of different types of MT systems (e.g. relying on linguistic criteria)
· use of word embeddings and continuous vector space representations in hybrid MT
· neural networks, deep learning and neural MT hybridization
· use and supply of open source tools and free language resources for hybrid MT
· presentations of industrial hybrid MT systems and technologies which involve hybrid MT systems in commercial and professional applications
Contributions can be short or long papers in the format of the workshop.
*Special Call for Shared Task Proposals*
We solicit proposals for shared tasks relevant to hybrid translation with the potential to be conducted in future editions of the HyTra workshop series. Proposals should include:
1) A definition of the objectives of the shared task (e.g. user generated content translation). 2) A suggestion of a baseline system (if appropriate). 3) Data to conduct the shared task. 4) An evaluation measure
Proposals should be different from those conducted elsewhere. We particularly welcome proposals which motivate the MT industry to participate. The proposals should be 2 pages long in the format required by the workshop.
*Paper submission* April 26, 2016 *Notification to authors* May 5, 2016 *Camera-ready deadline* May 9, 2016 *Conference* June 2, 2016
*Submission Information *
Authors are invited to submit papers on original and previously unpublished work. Reviewing of papers will be double-blind, so the submissions should not reveal the authors’ identity. Papers should be no longer than 12 pages (A4 size), in PDF format. Papers must conform to the format defined by the BJMC template <http://www.bjmc.lu.lv/instructions-to-authors/>.
Submission is electronic in PDF format using the Easychair submission system at the following url: https://easychair.org/conferences/?conf=hytra5
Double submission policy: Parallel submission to other meetings or publications are possible but must be immediately notified to the workshop contact person (see below). If accepted, withdrawals are only possible within two days after notification.
For an accepted paper to appear in the proceedings, at least one author must register for the workshop and actually present the paper. The papers will be published in the workshop proceedings which will be made available online, via the CEUR publication service.
*Program Committee *
· Arianna Bisazza, University of Amsterdam, The Netherlands · Bogdan Babych, University of Leeds, UK · Rafael E. Banchs, Institute for Infocomm Research, Singapore · Alexey Baytin, Yandex, Moscow, Russia · Pierrette Bouillon, ISSCO/TIM/ETI, University of Geneva, Switzerland · Marta R. Costa-jussa, UPC, Barcelona · Josep Maria Crego, Systran, Paris, France · Kurt Eberle, Lingenio GmbH, Heidelberg, Germany · Cristina España, UPC, Barcelona · Christian Federmann, Microsoft Research, Seattle, USA · José A. R. Fonollosa, UPC, Barcelona · Maxim Khalilov, Berlin, Germany · Udo Kruschwitz, University of Essex, UK · Patrik Lambert, Pompeu Fabra University, Barcelona, Spain · Maite Melero, Pompeu Fabra University, Barcelona, Spain · Reinhard Rapp, Universities of Aix-Marseille, France, and Mainz, Germany · George Tambouratzis, Institute for Language and Speech Processing, Greece · Jörg Tiedemann, University of Uppsala, Sweden · Grigori Sidorov, Instituto Politécnico Nacional, Mexico *Organizing Committee*
Marta R. Costa-jussà, Rafael E. Banchs, Reinhard Rapp, Patrik, Lambert, Kurt Eberle, Bogdan Babych -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 18916 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20160418/9b17c7c8/attachment.txt>