[Corpora-List] ELRA - Language Resources Catalogue - Update

ELDA info at elda.org
Tue Jun 20 13:08:01 CEST 2006


Our apologies if you have received multiple copies of this announcement

*******************************************************************
ELRA - Language Resources Catalogue - Update
*******************************************************************
We are happy to announce that new Text and Speech Language Resource are
now available in our catalogue.
To view all the Language Resources available, you can visit our on-line
catalogue : http://catalog.elda.org/index.php?language=en

L0067
<http://catalog.elda.org:8080/product_info.php?products_id=867&osCsid=0a57b78fd3504ecf1c75825782d061de>English
lexicon with morphological information
<http://catalog.elda.org:8080/product_info.php?products_id=867&osCsid=0a57b78fd3504ecf1c75825782d061de>This
English lexicon is made up of 174,000 inflected forms corresponding to
68,000 simple word lemmas (including 31,900 nouns, 11,800 verbs, 19,900
adjectives, 4,100 adverbs, 300 pronouns, articles,
prepositions/postpositions and conjunctions). Each line in the resource
file shows an inflected form, its part of speech, its related lemma and
its morphological information

<http://catalog.elda.org:8080/product_info.php?products_id=867&osCsid=0a57b78fd3504ecf1c75825782d061de>
L0068
<http://catalog.elda.org:8080/product_info.php?products_id=868&osCsid=0a57b78fd3504ecf1c75825782d061de>
French lexicon with morphological information
<http://catalog.elda.org:8080/product_info.php?products_id=868&osCsid=0a57b78fd3504ecf1c75825782d061de>
This French lexicon is made up of 424,000 inflected forms corresponding
to 55,000 simple word lemmas (including 34,400 nouns, 7,300 verbs,
11,700 adjectives, 1,400 adverbs, 200 pronouns, articles,
prepositions/postpositions and conjunctions). Each line in the resource
file shows an inflected form, its part of speech, its related lemma and
its morphological information.

<http://catalog.elda.org:8080/product_info.php?products_id=868&osCsid=0a57b78fd3504ecf1c75825782d061de>
L0069
<http://catalog.elda.org:8080/product_info.php?products_id=869&osCsid=0a57b78fd3504ecf1c75825782d061de>
Italian lexicon with morphological information
<http://catalog.elda.org:8080/product_info.php?products_id=869&osCsid=0a57b78fd3504ecf1c75825782d061de>
This Italian lexicon is made up of 862,500 inflected forms corresponding
to 112,000 simple word lemmas (including 66,340 nouns, 12,030 verbs,
28,080 adjectives, 4,890 adverbs, 660 pronouns, articles,
prepositions/postpositions and conjunctions). Each line in the resource
file shows an inflected form, its part of speech, its related lemma and
its morphological information.

<http://catalog.elda.org:8080/product_info.php?products_id=869&osCsid=0a57b78fd3504ecf1c75825782d061de>
L0070
<http://catalog.elda.org:8080/product_info.php?products_id=870&osCsid=0a57b78fd3504ecf1c75825782d061de>
Italian lexicon with morphological information and clitic verbs
<http://catalog.elda.org:8080/product_info.php?products_id=870&osCsid=0a57b78fd3504ecf1c75825782d061de>
This Italian lexicon is the same as the one described in ELRA-L0069, but
with the addition of clitic verbs, which increases the number of
inflected forms to 1,800,000 (still corresponding to 112,000 simple
words lemmas). It contains 66,340 nouns, 12,030 verbs, 28,080
adjectives, 4,890 adverbs, 660 pronouns, articles,
prepositions/postpositions and conjunctions. Each line in the resource
file shows an inflected form, its part of speech, its related lemma and
its morphological information.

<http://catalog.elda.org:8080/product_info.php?products_id=870&osCsid=0a57b78fd3504ecf1c75825782d061de>
L0071
<http://catalog.elda.org:8080/product_info.php?products_id=871&osCsid=0a57b78fd3504ecf1c75825782d061de>
Spanish lexicon with morphological information
<http://catalog.elda.org:8080/product_info.php?products_id=871&osCsid=0a57b78fd3504ecf1c75825782d061de>
This Spanish lexicon is made up of 816,000 inflected forms corresponding
to 104,000 simple word lemmas (including 52,000 nouns, 9,800 verbs,
21,200 adjectives, 20,500 adverbs, 500 pronouns, articles,
prepositions/postpositions and conjunctions). Each line in the resource
file shows an inflected form, its part of speech, its related lemma and
its morphological information.

<http://catalog.elda.org:8080/product_info.php?products_id=871&osCsid=0a57b78fd3504ecf1c75825782d061de>
S0217
<http://catalog.elda.org:8080/product_info.php?products_id=866&osCsid=0a57b78fd3504ecf1c75825782d061de>
BITS Logatome Synthesis Corpus – BITS-LG
<http://catalog.elda.org:8080/product_info.php?products_id=866&osCsid=0a57b78fd3504ecf1c75825782d061de>
This corpus contains 11,036 recordings of logatomes spoken by 4
professional German speakers covering all German diphone combinations as
well as the most prominent combination German - French - English. Each
logatome was recorded in three channels: close microphone, large
membrane microphone and laryngographic signal. All diphones are
segmented and labelled into phonemic units.

For more information on the catalogue, please contact Valérie Mapelli
mailto:mapelli at elda.org


<http://catalog.elda.org:8080/product_info.php?products_id=866&osCsid=0a57b78fd3504ecf1c75825782d061de>







More information about the Corpora-archive mailing list