[Corpora-List] Job: Linguistic Data Manager – Cambridge, MA

David Murgatroyd dmurga at yahoo.com
Thu Aug 9 12:44:01 CEST 2012


Linguistic Data Manager – Cambridge, MA Basis Technology is looking for a Linguistic Data Manager to play a key role in ensuring the high quality of our products. In this position, you will work with top notch players in the field of Natural Language Processing (NLP) technology to build models from large amounts of data in different languages and genres, through statistical analysis and machine learning. You will have the important task of facilitating the acquisition, evaluation and maintenance of this data, as well as overseeing others in the annotation of the data per our guidelines. Responsibilities

* Lead linguistic data management across the organization, including collection, annotation, documentation, dissemination, revision, evaluation and expiration

* Oversee annotation guidelines in collaboration with business, technical and annotation stakeholders

* Collaborate with product/research teams to define and fulfill linguistic data needs

* Join infrastructure team to define and evaluate tools for managing linguistic data (e.g., third-party annotation tools)

* Work with legal department to ensure compliance with data copyrights and licenses

* Hire and manage part-time multilingual staff of annotators, possibly including use of crowdsourcing (e.g., Amazon Mechanical Turk) Qualifications

* Master’s or Bachelor’s degree in language related discipline (e.g., linguistics, specific languages)

* 3-5 years related experience

* Record of success in coordinating team based projects

* Familiarity with command line scripting and version control systems

* Comfort with statistical measurements of data quality

* Ability to juggle multiple tasks, prioritize responsibilities and manage time effectively

* Excellent verbal and written communication skills How to Apply Please send cover letters, resumes and inquiries to: jobs-eng at basistech.com with “Linguistic Data Manager” in the subject line. -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 9958 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20120809/aa4553a1/attachment.txt>

More information about the Corpora mailing list