[Corpora-List] Job: NLP Engineer / Computer Scientist at Springer (Online) Book Archives, Heidelberg, Germany

Christian Chiarcos christian.chiarcos at web.de
Tue Dec 4 01:45:47 CET 2012


Dear list members,

I was asked to forward the following job announcement. Apologies for cross-posting.

Christian Chiarcos --

Job description Springer (Online) Book Archives Professionals, Fulltime, Mid-/Senior level, IT, NLP, Publishing Springer, Heidelberg (Frankfurt Area)

Introducing ourselves: With 63 publishing companies in 25 countries in Europe, Asia and the USA, Springer Science+ Business Media is now one of the world's leading publishers for specialist knowledge and information. Our publishing competence and our long tradition provide the ideal conditions for high-quality content.

Job duties:

To further increase the value of the book product portfolio, Springer decided to digitize all books ever published since 1842. For this project we are searching for a data professional who is responsible for the overall process chain starting with identification of archive titles and digital content creation up to the online publication and availability of data for print-on-demand purposes. If you are looking for a task that will make a difference for both science and society, and are interested in to applying your expertise to large, same-domain text corpora, and are excited to be part of this first-of-its-kind project, this job is for you.

The scope of tasks involves developing hands-on solutions for large data (bibliographic metadata including author information, text data, data formats e.g. PDF, XML), with future integration into Springer’s IT system landscape. In 3-digit thousand entry area, data input can be both structured and unstructured, integrated into the existing system landscape or not. The candidate should have proven track record in writing code in a variety of languages and environments. He or she should be able to understand the business requirements and willing to be ultimately responsible for both long-term, but also short-term, fast and pragmatic solutions.

Job requirements:

• M.S. or Ph.D. in Computer Science, Math, Engineering, or equivalent • Background in Machine Learning, Text Mining or Natural Language Processing (NLP) • Proven expertise (5 years or more) in database programming, SQL, scripting languages (e.g. PHP, Python), XML • Experience in automatic processing of scientific publications or related domains • Understanding of bibliographic metadata is of benefit • Industry experience is a plus • Pragmatic and priority-oriented, result and quality oriented mind-set is a must • Ability to solve problems and deliver on time • Excellent communication skills and ability to work with other teams in a global environment • Fluency in English (essential) and German (preferred)

For further details and to apply please see https://springer-career.becruiter.net//jobagent/search/default.aspx?design=_std&rowguid=c66f825a-ebab-4371-94c0-f0a0ccdbf9e2



More information about the Corpora mailing list