[Corpora-List] Final CFP: 2nd WAC Workshop, at EACL

Adam Kilgarriff adam at lexmasterclass.com
Fri Jan 6 10:51:01 CET 2006

Final Call for Papers:

In conjunction with the 11th Conference of the European Chapter of the
Association for Computational Linguistics (EACL)

Trento, Italy
April 4, 2006

Submissions by 6 Jan 2006 at

Workshop site:

Previous WaC Workshop:

Co-chairs: Adam Kilgarriff and Marco Baroni


Research on the Web as corpus is currently at a very exciting stage:
increasing evidence points to the enormous potential of the Internet as a
source of linguistic data, but we are still far from a working,
linguists' search engine. Many fundamental issues are just starting to be
tackled, ranging from Web frequency distributions and registers, to
efficient handling of massive data sets, to copyright.

We invite submissions which:

- describe Web corpus collection projects, or modules for one part of
the process (crawling, filtering, language-id, tokenizing,
lemmatizing, POS-tagging, indexing, ...)

- explore characteristics of Web data, from a linguistics/NLP

- use crawled Web data for NLP purposes.

Preference will be given to projects where Web data are downloaded and
processed directly, rather than via search engine interfaces.

Submission Information

Authors are invited to submit full papers on original, unpublished
work in the topic area of this workshop. Submissions should follow the
two-column format of ACL proceedings and should not exceed eight (8)
pages, including references. We strongly recommend the use of ACL
LaTeX or Microsoft Word style files tailored for this year's
conference available at


Papers must conform to the official EACL-06 style guidelines, and we
reserve the right to reject submissions that do not conform to these
styles, including font size restrictions. Submissions should be in PDF
format and must include all fonts, so that the paper will print (not
just view) anywhere.

Please submit your paper no later than January 6, 2006, at

Each submission will be reviewed at least by two members of the
program committee. Accepted papers will be published in the workshop

Dual submissions to the main EACL 2006 conference and this workshop
are allowed; if you submit to the main session, do indicate this when
you submit to the workshop, and specify your EACL submission reference
number, for administrative ease. If your paper is accepted for the
main session, you should withdraw your paper from the workshop upon
notification by the main session.

Important Dates

January 6, 2006 - Deadline for workshop papers

January 27, 2006 - Notification of acceptance

February 10, 2006 - Camera-ready papers due

April 4, 2006 - Workshop

Program Committee

Marco Baroni (co-chair)
Silvia Bernardini
Massimiliano Ciaramita
Stefan Evert
William H. Fletcher
Gregory Grefenstette
Frank Keller
Adam Kilgarriff (co-chair)
Mirella Lapata
Anke Lüdeling
Philip Resnik
Serge Sharoff


Adam Kilgarriff: adam_AT_lexmasterclass.com
Marco Baroni: baroni_AT_sslmit.unibo.it

More information about the Corpora-archive mailing list