[Corpora-List] Spanish dictionary with phonetic transcriptions

Alberto Barron Cedeno albarron at cs.upc.edu
Sat Mar 26 10:45:27 CET 2016


Dear Morten,

The DIMEx100 corpus includes pronunciation annotation and might be interesting. Unfortunately, the corpus website is down, but you might contact the researcher behind it, Luis Pineda:

http://turing.iimas.unam.mx/~luis/ <http://turing.iimas.unam.mx/~luis/>

Best regards Alberto


> On Mar 25, 2016, at 23:32, Gilles SÚrasset <gilles.serasset at imag.fr> wrote:
>
> Dear Morten,
>
> The DBnary dataset (an RDF version of several wiktionaries, see http://kaiko.getalp.org/about-dbnary ) contains around 33K pronunciations of canonical forms.
>
> You may explore the data using the sparql endpoint (http://kaiko.getalp.org/sparql ) before downloading the full dataset for the extraction of pronunciations.
>
> The following SPARQL query will give you the number of pronunciations of canonical forms:
>
> select count(?p) where {
> ?e dcterms:language lexvo:spa ;
> lemon:canonicalForm ?f .
> ?f lexinfo:pronunciation ?p
> }
>
> The following query will extract the first 10K such pronunciations:
>
> select ?w ?p where {
> ?e dcterms:language lexvo:spa ;
> lemon:canonicalForm ?f .
> ?f lexinfo:pronunciation ?p ;
> lemon:writtenRep ?w
> } LIMIT 1000 OFFSET 10
>
> playing with LIMIT and OFFSET will allow to extract all of them (the server limit will only give you 10K such pronunciations).
>
>
> Regards, Gilles,
>
>> On 22 Mar 2016, at 16:56, Morten Christiansen <christiansen at cornell.edu> wrote:
>>
>> Dear all,
>>
>> I am looking for an electronic Spanish dictionary that contains phonetic transcriptions (e.g., comparable to CELEX). I’ve searched around on the web and various corpus related websites but without luck. I would appreciate pointers to any available dictionaries. Thanks.
>>
>> Best,
>> Morten Christiansen
>>
>> ------
>> Morten H. Christiansen, PhD
>> Professor, Department of Psychology, Cornell University, Ithaca, NY 14853
>> Co-Director, Cornell Cognitive Science Program
>> Professor of Child Language, Aarhus University
>> Senior Scientist, Haskins Laboratories
>> Professor, Department of Language and Communication, University of Southern Denmark
>> Office: 228 Uris Hall || Phone: +1 (607) 255-3834 (dept) || Fax: +1 (607) 255-8433
>> Email: christiansen at cornell.edu
>> Web: http://www.psych.cornell.edu/people/faculty/christiansen-profile/
>> Cornell Cognitive Neuroscience Lab: http://cnl.psych.cornell.edu
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>> _______________________________________________
>> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
>> Corpora mailing list
>> Corpora at uib.no
>> http://mailman.uib.no/listinfo/corpora
>
>
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora

-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 5482 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20160326/95a83a46/attachment.txt>



More information about the Corpora mailing list