[Corpora-List] question about spanish corpus

Ralf Steinberger ralf.steinberger at jrc.ec.europa.eu
Mon Dec 3 17:32:32 CET 2012


Dear Parisa,

We have released a whole series of multilingual corpora, each containing a large number of Spanish text. These are the JRC-Acquis <http://ipsc.jrc.ec.europa.eu/?id=198> , DGT-Acquis <http://ipsc.jrc.ec.europa.eu/?id=783> , DGT Translation Memory <http://ipsc.jrc.ec.europa.eu/?id=197> , ECDC-Translation Memory <http://ipsc.jrc.ec.europa.eu/index.php?id=782> and the texts provided with the JRC EuroVoc Indexer JEX <http://ipsc.jrc.ec.europa.eu/?id=60> .

You can download the corpora from the JRC's Language Technology Resources page:

http://ipsc.jrc.ec.europa.eu/index.php?id=61

All the best,

Ralf

Ralf Steinberger <http://langtech.jrc.ec.europa.eu/RS.html> European Commission - Joint Research Centre (JRC) URL - Applications: http://emm.newsbrief.eu/overview.html URL - The science behind them: <http://langtech.jrc.it/> http://langtech.jrc.ec.europa.eu

From: corpora-bounces at uib.no [mailto:corpora-bounces at uib.no] On Behalf Of parisa berangi Sent: 03 December 2012 17:14 To: corpora at uib.no Subject: [Corpora-List] question about spanish corpus

Hello Corpora members, i'm looking for a large corpus in spanish language. the largest monolingual spanish corpus that i've found is wikiCorpus (www.lsi.upc.edu/~nlp/wikicorpus/) but i need a much larger one. does anyone know of where I can get a freely available large Spanish

corpus ? Thank you so much.

-- Parisa Berangi M.Sc. Student, Natural Language Processing Lab, School of Electrical and Computer Engineering University Of Tehran Tehran ,Iran

-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 27487 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20121203/324ada14/attachment.txt>



More information about the Corpora mailing list