We have released a whole series of multilingual corpora, each containing a large number of Spanish text. These are the JRC-Acquis <http://ipsc.jrc.ec.europa.eu/?id=198> , DGT-Acquis <http://ipsc.jrc.ec.europa.eu/?id=783> , DGT Translation Memory <http://ipsc.jrc.ec.europa.eu/?id=197> , ECDC-Translation Memory <http://ipsc.jrc.ec.europa.eu/index.php?id=782> and the texts provided with the JRC EuroVoc Indexer JEX <http://ipsc.jrc.ec.europa.eu/?id=60> .
You can download the corpora from the JRC's Language Technology Resources page:
http://ipsc.jrc.ec.europa.eu/index.php?id=61
All the best,
Ralf
Ralf Steinberger <http://langtech.jrc.ec.europa.eu/RS.html> European Commission - Joint Research Centre (JRC) URL - Applications: http://emm.newsbrief.eu/overview.html URL - The science behind them: <http://langtech.jrc.it/> http://langtech.jrc.ec.europa.eu
From: corpora-bounces at uib.no [mailto:corpora-bounces at uib.no] On Behalf Of parisa berangi Sent: 03 December 2012 17:14 To: corpora at uib.no Subject: [Corpora-List] question about spanish corpus
Hello Corpora members, i'm looking for a large corpus in spanish language. the largest monolingual spanish corpus that i've found is wikiCorpus (www.lsi.upc.edu/~nlp/wikicorpus/) but i need a much larger one. does anyone know of where I can get a freely available large Spanish
corpus ? Thank you so much.
-- Parisa Berangi M.Sc. Student, Natural Language Processing Lab, School of Electrical and Computer Engineering University Of Tehran Tehran ,Iran
-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 27487 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20121203/324ada14/attachment.txt>