[Corpora-List] Query about corpora of spoken English
Dr Wendy Anderson
W.Anderson at englang.arts.gla.ac.uk
Mon Dec 5 11:16:03 CET 2005
I had sent a similar message to Nicolas Ballier, but since there seems to be
some interest in aligning speech files and transcriptions I thought I should
post to the list too.
The SCOTS corpus, at the University of Glasgow, contains texts in spoken
Scottish English, as well as written texts. Our website is at:
www.scottishcorpus.ac.uk. At present we have about 50 spoken documents (and
growing): these range between Scottish
Standard English and dialects of the Scots tongue. The corpus is freely
available and can be searched online (or files downloaded if you prefer).
The sound (or video) files are aligned with an orthographic transcription,
which enables the user to click on a word and go directly to that utterance
in the sound file, or, vice versa, go to a point in the audio file and
scroll directly to the equivalent part of the transcription. We use Praat to
make time-aligned transcriptions.
Please do get in touch if you would like to know more. We gave a paper on
this subject at this summer's Corpus Linguistics 2005 - it doesn't seem to
be available online yet, but I can send a copy to anyone who is interested.
Dr Wendy J. Anderson
Scottish Corpus of Texts and Speech
Department of English Language
University of Glasgow
12 University Gardens
----- Original Message -----
From: "joshua raclaw" <Joshua.Raclaw at colorado.edu>
To: <R.M.Salkie at bton.ac.uk>
Cc: <CORPORA at UIB.NO>
Sent: Friday, December 02, 2005 4:02 PM
Subject: Re: [Corpora-List] Query about corpora of spoken English
> I'm not currently aware of any collection of spoken English corpora like
> if you could, please send any responses to the list and to Nicolas.
More information about the Corpora-archive