[Corpora-List] Job: Fixed term Software Engineer for Named Entities Extraction

Gaël de Chalendar gael.de-chalendar at cea.fr
Fri May 4 16:58:28 CEST 2018

** CEA LIST / LVIC, Palaiseau, Île-de-France, France **

** Software Engineer position. 12-month renewable contract. **



CEA LIST is a technological research institute specialized in the design and development of complex and software-intensive systems (http://www-list.cea.fr).

Within the LIST Institute, the Vision and Content Engineering Laboratory (LVIC) employs around 50 researchers and engineers working on the analysis and interpretation of multimedia data (text, image and video analysis). In the field of Artificial Intelligence, the laboratory develops robust algorithms for extracting, analyzing and processing large volumes of multimedia data. Our technologies have contributed to the emergence of new economic activities through the creation of startups such as ANT'Inno. In addition, the laboratory participates in numerous collaborative projects (ANR, Europe, Competitiveness Clusters) with academic partners, SMEs or major companies.

The laboratory has developed a multilingual linguistic analyzer named LIMA which was released under a Free license in 2014 (https://github.com/aymara/ lima). For the extraction of named entities, LIMA uses traditional technologies based on linguistic resources and rules and learning technologies with CRF models and neural network-based models (bi-LSTM). However, the language resources of the standard approach lack a regular update and the entities extracted by the different methods are not correlated. In this context, the laboratory is looking for a collaborator to work on these two aspects.

** Missions **

In this context, the work of the collaborator will consist of: - exploit online data such as Wikipedia or Wikidata to semi-automatically update resources for Named Entities Extraction; - develop a named entities merging algorithm that will merge results from the various methods.

**Required profile**

Computer engineer with Natural Language Processing (NLP) orientation.

The candidate will be proficient in C++ under GNU/Linux and Microsoft Windows. Knowledge of deep learning frameworks such as TensorFlow, Caffé2, etc. would be a plus. Required skills also include mastery of Python. The notions of continuous integration will have to be known. A good knowledge of NLP is essential.

Professional qualities: open-mindedness and curiosity, analytical and synthesis skills, ability to work in a team, strong motivation for research and innovation.

Compensation according to training and experience.


NanoInnov integration center (Saclay plateau, near Polytechnique)

** Time **

12 months renewable


Applications (CV + cover letter) should be sent to: Romaric Besançon <mailto:romaric.besancon at cea.fr> Gaël de Chalendar <mailto:gael.de-chalendar at cea.fr>

-- Gael de Chalendar CEA LIST Laboratoire Vision et Ingénierie des Contenus (Vision and Content Engineering Laboratory)

CEA SACLAY - NANO INNOV BAT. 861 Point courier 173 91191 GIF SUR YVETTE

Tél.:+ Fax:+ Email : Gael.D.O.T.de-Chalendar.A at T.cea.D.O.T.fr

-------------- next part -------------- A non-text attachment was scrubbed... Name: smime.p7s Type: application/pkcs7-signature Size: 3493 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20180504/e2400077/attachment-0001.p7s>

More information about the Corpora mailing list