On 19th November 2018, the ESRC Centre for Corpus Approaches to Social Science<http://cass.lancs.ac.uk/> (CASS) will be releasing full transcripts of the Spoken British National Corpus 2014 (Spoken BNC2014) for non-commercial research purposes, here<http://corpora.lancs.ac.uk/bnc2014/> This unique resource makes it possible to study current informal British speech and will be of interest to corpus linguists, NLP researchers, lexicographers, educators and anyone interested in spoken language. While the corpus is already available via Lancaster CQPweb<https://cqpweb.lancs.ac.uk/> the release of the transcripts will allow researchers to apply a much wider range of analytical techniques, because the corpus can now be uploaded to any XML-aware software.
The Spoken BNC2014 is a 10-million-word sample of current spoken British English recorded in different parts of the UK. It consists of transcribed and part-of-speech annotated informal conversations. The corpus was created through a collaboration between CASS and Cambridge University Press.
For sociolinguistic explorations of the Spoken BNC2014 and its comparison with BNC1994 - Demographic (the demographic part of the original BNC), you can also use BNClab (http://corpora.lancs.ac.uk/bnclab), a brand new sociolinguistic tool developed at Lancaster University.
Tony McEnery -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 3561 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20181116/e679573a/attachment.txt>