[Corpora-List] Extended Deadline (3 March): Language Technology for Cultural Heritage Data (LaTeCH-08)

Caroline Sporleder csporled at CoLi.Uni-SB.DE
Mon Feb 18 18:34:42 CET 2008


Due to several requests we have extended the deadline for LaTeCH-08 and we now also invite short papers and demos.

*********************************************************

CALL FOR PAPERS

***DEADLINE EXTENSION AND INVITATION FOR SHORT PAPERS/DEMOS***

LREC 2008 Workshop on

Language Technology for Cultural Heritage Data

(LaTeCH 2008)

Special Theme:

"Resources and Tools for Studying Language

Variety and Change"

1 June, 2008, Marrakech, Morocco

http://ilk.uvt.nl/latech08/

*Extended* submission deadline: *3 March 2008*

*********************************************************

The Second Workshop on Language Technology for Cultural Heritage Data (LaTeCH 2008) will be held in conjunction with LREC 2008, and will take place on June 1 in Marrakech, Morocco.

* SCOPE AND TOPICS

Museums, archives, and libraries around the world maintain large collections of cultural and scientific heritage objects, such as archaeological artefacts, audio and video recordings, or manuscripts, archival documents and other written sources. Such collections are a potentially very valuable resource for specialists and laypersons alike, provided they can be easily accessed and automatically processed. Furthermore, textual cultural heritage resources, such as old manuscripts and early printed books, are not only interesting for their information content, but are also an invaluable source for linguistic research on diachronic and synchronic language variety and change. While several large scale digitisation projects are currently underway to make cultural heritage resources more accessible, it is equally important to develop powerful tools to search, link, enrich, and mine the digitised data. Language technology has a crucial role to play in this, even for collections which are primarily non-textual, since text is the pervasive medium used for meta-data. At the same time, the cultural heritage domain poses special challenges for the NLP community, including the use of historic or non-standard language, the presence of OCR or transcription errors in the data, and the necessity to deal with data from various media.

For LaTeCH 2008, we invite papers on language technology for cultural heritage data in general and on the special theme of "Studying Language Variety and Change". Topics of interest include, but are not limited to, the following:

- enriching cultural heritage data by inducing meta-data

- dealing with linguistic variation and non-standard or

non-contemporary use of language

- automatic error detection and cleaning

- adapting existing NLP tools for the cultural heritage domain

- linking and retrieving information from different sources, media,

and languages

- representing cultural heritage data to different audiences

(personalisation, text simplification, text summarisation, text

generation from databases, hypertext generation)

- knowledge discovery in cultural heritage data

- complex annotation tools

- determination of word and sentence boundaries within manuscripts

- resources for and treatment of dialects (general solutions)

- annotations of language variety on the orthographic, morphological,

and syntactic level

- global language resource management systems

- repositories of cultural and scientific heritage data

* SUBMISSIONS

Authors are invited to submit papers on original, unpublished work in the topic area of this workshop. In addition to long papers presenting completed work, we also invite short papers and demos:

- Long papers should present completed work and should not exceed 8

pages.

- Short papers/demos can present work in progress or the description

of a system, and should not exceed 5 pages.

All submissions should be formatted using the LREC 2007 stylefiles, which are available at:

http://www.lrec-conf.org/lrec2008/Download-Templates.html

The reviewing process will be blind; the papers should not include the authors' names and affiliations or any references to web sites, project names etc. revealing the authors' identity. Each submission will be reviewed by at least two members of the program committee. Accepted papers will be published in the workshop proceedings.

Papers should be submitted electronically, no later than March 3, 2008 via the following website: https://www.softconf.com/LREC2008/LaTeCH2008/submit.html

The only accepted format for submitted papers is Adobe PDF.

* IMPORTANT DATES

3 March - Extended deadline for workshop papers 28 March - Notification of acceptance

9 April - Camera-ready papers due

1 June - Workshop held at LREC 2008

* PROGRAM COMMITTEE

Ion Androutsopoulos, Athens University of Economics and Business, Greece Timothy Baldwin, University of Melbourne, Australia David Bamman, Perseus, USA David Birnbaum, University of Pittsburgh, USA Antal van den Bosch, Tilburg University, The Netherlands Andrea Bozzi, ILC-CNR, Pisa, Italy Kate Byrne, University of Edinburgh, Scotland Paul Clough, Sheffield University, UK Greg Crane, Perseus, USA Vania Dimitrova, University of Leeds, UK Milena P. Dobreva, HATII, University of Glasgow, Scotland Mick O'Donnell, Universidad Autonoma de Madrid, Spain Matthew James Driscoll, Kobenhavns Universitet, Denmark Franciska de Jong, University of Twente, The Netherlands Claire Grover, University of Edinburgh, Scotland Ben Hachey, University of Edinburgh, Scotland Djoerd Hiemstra, University of Twente, The Netherlands Dolores Iorizzo, Imperial College London, UK Christer Johansson, University of Bergen, Norway Jaap Kamps, University of Amsterdam, The Netherlands Vangelis Karkaletsis, NCSR "Demokritos", Greece Piroska Lendvai, Tilburg University, The Netherlands Anke Luedeling, Humboldt-Universitaet, Germany Roland Meyer, University of Regensburg, Germany Maria Milosavljevic, University of Edinburgh, Scotland Marie-Francine Moens, Katholieke Universiteit Leuven, Belgium John Nerbonne, Rijksuniversiteit Groningen, The Netherlands Marco Passarotti, Universita Cattolica del Sacro Cuore, Italy Martin Reynaert, Tilburg University, The Netherlands Kiril Ribarov, Charles University, Czech Republic Maarten de Rijke, University of Amsterdam, The Netherlands Peter Robinson, ITSEE, UK Maria Simi, University of Pisa, Italy Caroline Sporleder, Saarland University, Germany

* ORGANISING COMMITTEE

Caroline Sporleder (Co-Chair), Saarland University, Germany Kiril Ribarov (Co-Chair), Charles University, Czech Republic Antal van den Bosch, Tilburg University, The Netherlands Milena P. Dobreva, HATII, University of Glasgow, Scotland Matthew James Driscoll, Kobenhavns Universitet, Denmark Claire Grover, University of Edinburgh, Scotland Piroska Lendvai, Tilburg University, The Netherlands Anke Luedeling, Humboldt-Universitaet, Germany Marco Passarotti, Universita Cattolica del Sacro Cuore, Italy

* FURTHER INFORMATION

Workshop web page: http://ilk.uvt.nl/latech08/

LREC 2008 web page: http://www.lrec-conf.org/lrec2008/

* CONTACT INFORMATION

Caroline Sporleder, csporled (at) coli.uni-sb.de Kiril Ribarov, ribarov (at) ufal.mff.cuni.cz

-- --------------------------------------------------------- Caroline Sporleder Computational Linguistics, Saarland University csporled at coli.uni-sb.de http://www.coli.uni-saarland.de/~csporled ---------------------------------------------------------



More information about the Corpora mailing list