[Corpora-List] Final CfP (extended deadline): Measuring Language Complexity

Aleksandrs Berdicevskis aleksandrs.berdicevskis at lingfil.uu.se
Wed Feb 7 15:09:56 CET 2018


[Apologies for multiple postings]

Measuring Language Complexity Date and location: April 15, 2018, Torun, Poland (satellite event of Evolang XII, https://evolang.cles.umk.pl/) Call deadline (extended): 28-Feb-2018 Website: http://www.christianbentz.de/MLC_index.html Contact: aleksandrs.berdicevskis at lingfil.uu.se<mailto:aleksandrs.berdicevskis at lingfil.uu.se>

Third call for papers:

An influential line of thinking within evolutionary linguistics is that languages change in response to socioecological pressures, i.e. adapt to their environmental niches. Language complexity is a common parameter to test for such adaptation. It is, however, notoriously difficult to define and measure. Virtually every study of complexity uses its own operationalization and measure. On one hand, this diversity is beneficial for the field, since an intricate phenomenon is being studied from different angles. On the other hand, the comparison of different studies is inhibited. This is particularly problematic if different measures yield different conclusions, since there currently is little consensus about how measures themselves can be evaluated and compared.

To overcome this, we organize a shared task (shared tasks are widely used in computational linguistics) on linguistic complexity, namely: Measure and compare the complexities of a set of 37 language varieties of 7 families (http://www.christianbentz.de/MLC_data.html, submissions covering only a part of the sample may also be considered). The participants are free to choose whether they want to measure just one facet of complexity (e.g. phoneme/grapheme inventory, morphology, word order), or try to develop an overall complexity measure. The complexity measure can be based on any conceivable metric. The submissions, however, have to clearly state: 1) what exactly is being measured (e.g. overspecification, lexical diversity, irregularity, verbosity, opacity etc.); 2) how the measure is calculated, and the theoretical rationale behind the method; 3) the resulting value for each language.

To facilitate the comparability of different measures, we request that the participants who apply corpus-based measures use the corpora available via the Universal Dependencies project, v2.1 (http://www.christianbentz.de/MLC_data.html). Participants are free to decide which level of annotation they want to use. Plain-text files are also available for those who do not need any annotation. Participants who do not need corpora are exempt from this requirement. We also require that the participants submit all relevant calculations and scripts as supplementary materials (after acceptance).

For further details, see http://www.christianbentz.de/MLC_CFP.html

-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 8271 bytes Desc: not available URL: <https://www.uib.no/mailman/public/corpora/attachments/20180207/8f060d5a/attachment.txt>



More information about the Corpora mailing list