[Corpora-List] ELRA - Language Resources Catalogue - Update

ELDA info at elda.org
Tue Mar 20 12:46:07 CET 2007


Our apologies if you have received multiple copies of this announcement.

*******************************************************************
ELRA - Language Resources Catalogue - Update
*******************************************************************

ELRA is happy to announce that 3 new Speech Related Resources are now
available in its catalogue.
Moreover, we are pleased to announce that years 2005 and 2006 from the
Text Corpus of "Le Monde" (ELRA-W0015)* *are now available.

*ELRA-S0235 LC-STAR Hebrew (Israel) phonetic lexicon
*The LC-STAR Hebrew (Israel) phonetic lexicon comprises 109,580 words,
including a set of 62,431 common words, a set of 47,149 proper names
(including person names, family names, cities, streets, companies and
brand names) and a list of 8,677 special application words. The lexicon
is provided in XML format and includes phonetic transcriptions in SAMPA.
For more information, see:
http://catalog.elra.info/product_info.php?products_id=984&language=en
<http://catalog.elra.info/product_info.php?products_id=13&language=en>

*ELRA-S0236 LC-STAR English-Hebrew (Israel) Bilingual Aligned Phrasal
lexicon
*The LC-STAR English-Hebrew (Israel) Bilingual Aligned Phrasal lexicon
comprises 10,520 phrases from the tourist domain. It is based on a list
of short sentences obtained by translation from US-English 10,449
phrasal corpus. The lexicon is provided in XML format.
For more information, see:
http://catalog.elra.info/product_info.php?products_id=985&language=en
<http://catalog.elra.info/product_info.php?products_id=982&language=en>

*ELRA-S0237 LC-STAR US English phonetic lexicon
*The LC-STAR US English phonetic lexicon comprises 102,310 words,
including a set of 51,119 common words, a set of 51,111 proper names
(including person names, family names, cities, streets, companies and
brand names) and a list of 6,807 special application words. The lexicon
is provided in XML format and includes phonetic transcriptions in SAMPA.
For more information, see:
http://catalog.elra.info/product_info.php?products_id=986&language=en
<http://catalog.elra.info/product_info.php?products_id=983&language=en>

*ELRA-W0015 Text corpus of "Le Monde"
*Corpus from "Le Monde" newspaper. Years 1987 to 2002 are available in
an ASCII text format. Years 2003 to 2006 are available in .XML format.
Each month consists of some 10 MB of data (circa 120 MB per year).
For more information, see:
http://catalog.elra.info/product_info.php?products_id=438&language=en
<http://catalog.elra.info/product_info.php?products_id=438&language=en>


For more information on the catalogue, please contact Valérie Mapelli
mailto:mapelli at elda.org

Our on-line catalogue has moved to the following address:
http://catalog.elra.info. Please update your bookmarks.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: https://mailman.uib.no/public/corpora-archive/attachments/20070320/92de9e14/attachment.html


More information about the Corpora-archive mailing list