[Corpora-List] Query about corpora of spoken English

Dr Wendy Anderson W.Anderson at englang.arts.gla.ac.uk
Mon Dec 5 11:16:03 CET 2005

Dear all,
I had sent a similar message to Nicolas Ballier, but since there seems to be
some interest in aligning speech files and transcriptions I thought I should
post to the list too.
The SCOTS corpus, at the University of Glasgow, contains texts in spoken
Scottish English, as well as written texts. Our website is at:
www.scottishcorpus.ac.uk. At present we have about 50 spoken documents (and
growing): these range between Scottish
Standard English and dialects of the Scots tongue. The corpus is freely
available and can be searched online (or files downloaded if you prefer).
The sound (or video) files are aligned with an orthographic transcription,
which enables the user to click on a word and go directly to that utterance
in the sound file, or, vice versa, go to a point in the audio file and
scroll directly to the equivalent part of the transcription. We use Praat to
make time-aligned transcriptions.
Please do get in touch if you would like to know more. We gave a paper on
this subject at this summer's Corpus Linguistics 2005 - it doesn't seem to
be available online yet, but I can send a copy to anyone who is interested.

Wendy Anderson
Dr Wendy J. Anderson
Research Assistant
Scottish Corpus of Texts and Speech
Department of English Language
University of Glasgow
12 University Gardens
G12 8QQ
Scotland, UK

Website: http://www.scottishcorpus.ac.uk
----- Original Message -----
From: "joshua raclaw" <Joshua.Raclaw at colorado.edu>
To: <R.M.Salkie at bton.ac.uk>
Sent: Friday, December 02, 2005 4:02 PM
Subject: Re: [Corpora-List] Query about corpora of spoken English

> I'm not currently aware of any collection of spoken English corpora like

that -

> if you could, please send any responses to the list and to Nicolas.


> Joshua

More information about the Corpora-archive mailing list