[Corpora-List] 2 Post-doc positions in Natural Language Processing (France)

delphine.charlet at orange.com delphine.charlet at orange.com
Thu Feb 22 13:56:58 CET 2018


Orange Labs, the R&D division of Orange, offers 2 post-doc positions in Natural Language Processing, starting in April 2018. -Duration: 12 months, starting in April 2018, open until filled. -Location: Lannion, France -Contract: fixed term position -Remuneration: approx. 2,400€ /month net income (in addition, the contract includes health benefits, variable gratification and various corporate advantages)

Applicants will work in the Orange Labs NLP team, composed of ~20 people, which gather various skills, from research on several NLP suject to the development of actual industrial analytics tools. In the research domain, besides from feeding the development team with new algorithms, we collaborate with academic labs, through direct research contracts or trough collaborative projects (ongoing French ANR funded projects: PASTEL, DATCHA), and we participate to international evaluation challenges (in 2017: SemEval on Community Question Answering and CoNNL shared task on universal dependencies).

###################################### # Post-Doc 1: Multilingual Word Embeddings

##Topic Word Embeddings are now considered as the standard level of word representation in many tasks of Natural Language Processing. Trained in an unsupervised way, thus being able to take advantage of huge amount of textual data, they capture semantic and syntactic relations between words, and can be used in many supervised or unsupervised NLP tasks. Embeddings estimated on one language are not initially compatible with embeddings trained on another language. The goal of the post-doc is to study different solutions to build a unified representation space for different languages, where semantically related words from different languages will be close. Several recent research works have been devoted to bilingual word embeddings, and methods can be divided into 2 main approaches. One consists in training bilingual embeddings based on parallel corpora. The other attempts to map, in a shared space, embeddings which have been trained independently on different languages. This is the latter method which will be explored in the post-doc, due to the unavailability of parallel corpora in the applicative fields studied by Orange.

## Profile: PhD in computer sciences Familiarity with language technology Good knowledge of English and ability to integrate a French speaking working team.

## Contact Information: delphine.charlet at orange.com<mailto:delphine.charlet at orange.com>

##################################### # Post-Doc 2: Natural Language Generation

## Topic: Our team is also working on Question/Answering system based on structured knowledge database (RDF based) In order to output information, NLG techniques can be used to translate structured representations into natural language utterances. The main objective of the post-doc is to enable our research project team to rapidly develop competence on the subject of Natural Language Generation:

Presenting state of the art approaches and tools

Adapting and evolving existing tools

## Profile: PhD in Natural Langage Processing

profound knowledge in Natural Language Processing with a specialization in Natural Language Generation

experience in creating and modifying NLG software

ability to work in a team and to share experience with others

## Contact Information: johannes.heinecke at orange.com<mailto:johannes.heinecke at orange.com>

################################# ##Working Environment Situated on the beautiful north-west coast of Brittany (“Cote de Granit Rose”), Lannion is a small but vibrant city with a rich natural and historical heritage. It benefits from a strong local industrial network, hosting academics research institutes, big companies and start-ups. The Orange Labs campus in Lannion hosts one thousand engineers and researchers.

_________________________________________________________________________________________________________________________

Ce message et ses pieces jointes peuvent contenir des informations confidentielles ou privilegiees et ne doivent donc pas etre diffuses, exploites ou copies sans autorisation. Si vous avez recu ce message par erreur, veuillez le signaler a l'expediteur et le detruire ainsi que les pieces jointes. Les messages electroniques etant susceptibles d'alteration, Orange decline toute responsabilite si ce message a ete altere, deforme ou falsifie. Merci.

This message and its attachments may contain confidential or privileged information that may be protected by law; they should not be distributed, used or copied without authorisation. If you have received this email in error, please notify the sender and delete this message and its attachments. As emails may be altered, Orange is not liable for messages that have been modified, changed or falsified. Thank you.

-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 22296 bytes Desc: not available URL: <https://www.uib.no/mailman/public/corpora/attachments/20180222/a7b7f485/attachment.txt>



More information about the Corpora mailing list