[Corpora-List] Norwegian Speech Database Release Announcement

Johanne Ostad Johanne.Ostad at nb.no
Tue Sep 29 10:01:59 CEST 2015


Dear Corpora List Members,

«Språkbanken» at The National Library of Norway recently published «NB Tale», a basic acoustic phonetic speech database for Norwegian.

Språkbanken is a language technology resource collection for Norwegian and offers digital language resources for use in research and development of language technology. The resources can be downloaded from Språkbanken's website free of charge. The collection is expanding continuously: http://www.nb.no/sprakbanken/#ticketsfrom?lang=en

«NB Tale» is the first speech database for Norwegian designed to provide speech data for acoustic-phonetic studies and for the development and evaluation of automatic speech recognition systems. The database contains recordings of 377 speakers from 24 different speaker groups, representing both first- and second language speakers of Norwegian. Each speaker has read 20 sentences from a manuscript, and approximately two minutes of spontaneous speech. The manuscript contains both Norwegian Bokmål and Norwegian Nynorsk sentences that are carefully selected from text corpora, aiming to balance a number of phonetic and phonemic features. Recordings are in 48 kHz / 16 bit, done with two microphones (lapel and studio). The database is annotated in X-SAMPA (Extended SAMPA).

The database has been developed by Lingit AS, commissioned by The National Library of Norway.

License is CC-0.

For more information and download: http://www.nb.no/sprakbanken/show?serial=sbr-31&lang=en

Kind regards, Johanne Ostad

Johanne Ostad Avd. Forskning og formidling Nasjonalbiblioteket Tlf: +47 23 27 62 27 www.nb.no<http://www.nb.no/>

-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 6164 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20150929/bf08353c/attachment.txt>



More information about the Corpora mailing list