[Corpora-List] Spanish dictionary with phonetic transcriptions

Marc Brysbaert Marc.Brysbaert at UGent.be
Tue Mar 29 16:05:47 CEST 2016


In Espal you find phonetic transcriptions of 250+K words types: http://www.bcbl.eu/databases/espal/

Best, marc

From: corpora-bounces at uib.no [mailto:corpora-bounces at uib.no] On Behalf Of Alberto Barron Cedeno Sent: zaterdag 26 maart 2016 10:45 To: Morten Christiansen; corpora at uib.no Subject: Re: [Corpora-List] Spanish dictionary with phonetic transcriptions

Dear Morten,

The DIMEx100 corpus includes pronunciation annotation and might be interesting. Unfortunately, the corpus website is down, but you might contact the researcher behind it, Luis Pineda:

http://turing.iimas.unam.mx/~luis/

Best regards Alberto

On Mar 25, 2016, at 23:32, Gilles SÚrasset <gilles.serasset at imag.fr<mailto:gilles.serasset at imag.fr>> wrote:

Dear Morten,

The DBnary dataset (an RDF version of several wiktionaries, see http://kaiko.getalp.org/about-dbnary ) contains around 33K pronunciations of canonical forms.

You may explore the data using the sparql endpoint (http://kaiko.getalp.org/sparql ) before downloading the full dataset for the extraction of pronunciations.

The following SPARQL query will give you the number of pronunciations of canonical forms:

select count(?p) where {

?e dcterms:language lexvo:spa ;

lemon:canonicalForm ?f .

?f lexinfo:pronunciation ?p }

The following query will extract the first 10K such pronunciations:

select ?w ?p where {

?e dcterms:language lexvo:spa ;

lemon:canonicalForm ?f .

?f lexinfo:pronunciation ?p ;

lemon:writtenRep ?w } LIMIT 1000 OFFSET 10

playing with LIMIT and OFFSET will allow to extract all of them (the server limit will only give you 10K such pronunciations).

Regards, Gilles,

On 22 Mar 2016, at 16:56, Morten Christiansen <christiansen at cornell.edu<mailto:christiansen at cornell.edu>> wrote:

Dear all,

I am looking for an electronic Spanish dictionary that contains phonetic transcriptions (e.g., comparable to CELEX). I’ve searched around on the web and various corpus related websites but without luck. I would appreciate pointers to any available dictionaries. Thanks.

Best,

Morten Christiansen

------ Morten H. Christiansen, PhD Professor, Department of Psychology, Cornell University, Ithaca, NY 14853 Co-Director, Cornell Cognitive Science Program Professor of Child Language, Aarhus University Senior Scientist, Haskins Laboratories Professor, Department of Language and Communication, University of Southern Denmark Office: 228 Uris Hall || Phone: +1 (607) 255-3834 (dept) || Fax: +1 (607) 255-8433 Email: christiansen at cornell.edu<mailto:christiansen at cornell.edu> Web: http://www.psych.cornell.edu/people/faculty/christiansen-profile/ Cornell Cognitive Neuroscience Lab: http://cnl.psych.cornell.edu

_______________________________________________ UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora Corpora mailing list Corpora at uib.no<mailto:Corpora at uib.no> http://mailman.uib.no/listinfo/corpora

_______________________________________________ UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora Corpora mailing list Corpora at uib.no<mailto:Corpora at uib.no> http://mailman.uib.no/listinfo/corpora

-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 8147 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20160329/38bbd668/attachment.txt>



More information about the Corpora mailing list