[Corpora-List] PhD Scholarship in Language Technology at the University of Copenhagen

Bolette S Pedersen bspedersen at hum.ku.dk
Fri Jun 19 12:22:00 CEST 2009


Co-financed PhD Scholarship in Language Technology

at the University of Copenhagen

The Centre for Language Technology, under the Faculty of Humanities at the University of Copenhagen, is inviting applications for a PhD scholarship in Language Technology starting 1 February, 2010 for a period of up to three years. The PhD scholar will be jointly associated with the Centre for Language Technology and the Department of Computer Science, both at the University of Copenhagen.

*Automatic knowledge extraction from linguistic corpora *The PhD scholarship is announced within the interdisciplinary research area of Information Technology and Language, which combines knowledge in computer science with knowledge of linguistics. Large collections of written or spoken corpora are currently being developed for a large group of languages, including Danish. Tagged corpora make it possible to train machine learning algorithms and other statistical models which are able to extract different kinds of knowledge from language data.

This knowledge may concern syntactic or semantic information at word level, for example parts of speech or meaning in a given context. Or it can relate to syntactic structure - how to assign constituent or dependency structure to phrases. Finally, it may concern the content of written or spoken corpora, which is generally defined and represented in very different ways depending on the model and the purpose.

The PhD project should focus on one or more of these aspects of knowledge extraction, and the PhD scholar should relate to relevant language technology applications, for instance development of lexical resources (computational lexicons and wordnets), grammar in relation to parsing or statistically-based machine translation, knowledge extraction for expert systems or training of response types in adaptive dialogue systems.

For further information see http://www.humanities.ku.dk/research/PhD/Announcements/languagetechnology/

-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 2860 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20090619/330dc9c8/attachment.txt>



More information about the Corpora mailing list