I am pleased to announce a new linguistic resource developed at Centro Ramón Piñeiro para a Investigación en Humanidades (http://www.cirp.es).
It is a tagged corpus for Galician language revised by hand which include more than 300.000 gramatical elements extracted from texts of newspapers and journals. So, it is suitable to be used to train different statistical linguistic tools.
You can find more information and a link to download it at:
(Descargas/Corpus de adestramento section).
It is released under the LGPLLR license (see COPYING file of the package for details).
-- Fco. Mario Barcala Rodríguez Computing manager of CORGA project