Parliamentary data is a major source of socially relevant content. It is available in ever larger quantities, is multilingual, accompanied by rich metadata, and has the distinguishing characteristic that it is spoken language produced in controlled circumstances which has traditionally been transcribed but is now increasingly released also in audio and video formats. All these factors require solutions related to structuring, synchronization, visualization, querying and analysis of parliamentary corpora. Furthermore, approaches to the exploitation of parliamentary corpora to their full extent also have to take into account the needs of researchers from vastly different Humanities and Social Sciences fields, such as political sciences, sociology, history, and psychology.
A successful first edition of the ParlaCLARIN scientific workshop held at LREC 2018 (https://www.clarin.eu/ParlaCLARIN) and a follow-up developmental ParlaFormat workshop held by CLARIN ERIC in 2019 (https://www.clarin.eu/event/2019/parlaformat-workshop) resulted in a good overview of the multitude of the existing parliamentary resources worldwide as well as tangible first steps towards better harmonization, interoperability and comparability of the resources and tools relevant for the study of parliamentary discussions and decisions.
The second ParlaCLARIN workshop therefore aims to bring together developers, curators and researchers of regional, national and international parliamentary debates that are suitable for research in disciplines in the Humanities and Social Sciences. We invite unpublished original work focusing on the compilation, annotation, visualisation and utilisation of parliamentary records as well as linking or comparing parliamentary records with other datasets of political discourse such as party manifestos, political speeches, political campaign debates, social media posts, etc. Apart from dissemination of the results, the workshop also aims to address the identified obstacles, discuss open issues and coordinate future efforts in this increasingly trans-national and cross-disciplinary community.
Due to the Freedom of Information Acts that are supported by the United Nations and set in place in over 100 countries worldwide, parliamentary debates are being increasingly easy to obtain, and have always been of interest to researchers from a wide range fields in Humanities and Social Sciences both for the potential influence of their content, and the specificities of the formalized, often persuasive and emotional language use in this context. As a consequence, there are many initiatives, on the national and international levels, that aim at compiling and analysing parliamentary data. The recent CLARIN-PLUS survey on parliamentary data has identified over 20 corpora of parliamentary records, with over half of them being available within the CLARIN infrastructure (https://www.clarin.eu/resource-families/parliamentary-corpora).
Given the maturity, variety, and potential of this type of language data as well as the rich metadata it is complemented with, it is urgent to gather researchers both from the side of those producing parliamentary corpora and making them available, those making use of them for linguistic, historical, political, sociological etc. research as well as those linking or comparing them with other datasets of political discourse such as party manifestos, political speeches, political campaign debates, social media posts, etc. in order to share methods and approaches of compiling, annotating and exploring parliamentary and other political language data in order to achieve harmonization of the compiled resources, and to ensure current and future comparability of research on national datasets as well as promote transnational analyses.
Topics of interest
Topics include but are not limited to: - Creation and annotation of parliamentary data in textual, spoken and video format - Annotation standards and best practices for parliamentary corpora - Accessibility, querying and visualisation of parliamentary data - Text analytics, semantic processing and linking of parliamentary and other datasets of political language data - Parliamentary corpora and multilinguality - Studies based on parliamentary corpora - Studies comparing parliamentary corpora with other types of political discourse
Submission & Publication
We accept submission of long papers (up to 8 pages), short papers (up to 4 pages) and demo papers (up to 4 pages) to be presented as a long or short oral presentation at the workshop. The papers of the workshop will be published in online proceedings.
When submitting a paper from the START page, authors will be asked to provide essential information about resources (in a broad sense, i.e. also technologies, standards, evaluation kits, etc.) that have been used for the work described in the paper or are a result of your research. Moreover, ELRA encourages all LREC authors to share the described LRs (data, tools, services, etc.) to enable their reuse and replicability of experiments (including evaluation ones). For contact data, stylesheets, up-to-date details on submission and the workshop itself, please consult the workshop website.
Submission page: will be communicated by 20 December 2019
- Paper submission deadline: 14 February 2020 - Notification of acceptance: 13 March 2020 - Camera-ready paper: 2 April 2020 - Workshop date: 12 May 2020
- Darja Fišer, University of Ljubljana and Jožef Stefan Institute - Franciska de Jong, CLARIN ERIC - Maria Eskevich, CLARIN ERIC
The workshop is supported by the CLARIN research infrastructure. To contact the organizers, please mail clarin at clarin.eu<mailto:clarin at clarin.eu> (Subject: [ParlaCLARIN at LREC2020]).
Programme Committee (in alphabetical order)
Bente Maegaard, University of Copenhagen, Denmark Francesca Frontini, Université Paul Valéry - Montpellier, France Henk van den Heuvel, Radboud University, The Netherlands Jan Odijk, Utrecht University, The Netherlands Kaspar Beelen, The Alan Turing Institute, UK Klaus Illmayer, Austrian Academy of Sciences, Austria Laura Morales, Sciences Po, France Maciej Ogrodniczuk, Institute of Computer Science, Polish Academy of Sciences, Poland Maria Gavriilidou, ILSP/Athena RC, Greece Maria Pontiki, ILSP/Athena RC, Greece Monica Monachini, National Research Council of Italy, Italy Petya Osenova, IICT-BAS and Sofia University "St. Kl. Ohridski", Bulgaria Sara Tonelli, Fondazione Bruno Kessler, Italy Simone Paolo Ponzetto, University of Mannheim, Germany Stelios Piperidis, ILSP/Athena RC, Greece Tamás Váradi, Hungarian Academy of Sciences, Hungary Tanja Wissik, Austrian Academy of Sciences, Austria Tomaž Erjavec, Jožef Stefan Institute
Identify, Describe and Share your LRs!
Describing your LRs in the LRE Map is now standard practice in the submission procedure of LREC (introduced in 2010 and adopted by other conferences). To continue the efforts initiated at LREC 2014 about “Sharing LRs” (data, tools, web-services, etc.), authors will have the possibility, when submitting a paper, to upload LRs in a special LREC repository. This effort of sharing LRs, linked to the LRE Map for their description, may become a new “regular” feature for conferences in our field, thus contributing to creating a common repository where everyone can deposit and share data.
As scientific work requires accurate citations of referenced work so as to allow the community to understand the whole context and also replicate the experiments conducted by other researchers, LREC 2020 endorses the need to uniquely Identify LRs through the use of the International Standard Language Resource Number (ISLRN, www.islrn.org<http://www.islrn.org>), a Persistent Unique Identifier to be assigned to each Language Resource. The assignment of ISLRNs to LRs cited in LREC papers will be offered at submission time.
Univerza v Ljubljani Filozofska fakulteta Assoc. Prof. dr. Darja Fišer Oddelek za prevajalstvo / Department of translation Filozofska fakulteta / Faculty of arts Aškerčeva cesta 2, SI-1000 Ljubljana, Slovenija / Slovenia darja.fiser at ff.uni-lj.si<mailto:darja.fiser at ff.uni-lj.si>, www.ff.uni-lj.si<http://www.ff.uni-lj.si>
<http://www.uni-lj.si> [cid:3E2D45B8-8A16-4FEF-88B1-495AB2C90733 at ff.uni-lj.si]
-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 14666 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20191212/c895137c/attachment.txt> -------------- next part -------------- A non-text attachment was scrubbed... Name: logo_100.png Type: image/png Size: 8126 bytes Desc: logo_100.png URL: <https://mailman.uib.no/public/corpora/attachments/20191212/c895137c/attachment.png>