[Corpora-List] German political speeches corpus [update]

Adrien Barbaresi barbaresi at bbaw.de
Mon Jun 17 13:12:35 CEST 2019

Dear corpora-list readers,

I would like to inform you about a significant update of the German political speeches corpus, which now features more sources, spanning a time from 1984 to 2017. The corpus currently includes 6,685 speeches by 71 speakers, amounting to about 13 million tokens.

It is released under Creative Commons Attribution-ShareAlike license (CC BY-SA) and can be downloaded here as an archive in XML format: http://purl.org/corpus/german-speeches

It can also be queried online using a full-text search including linguistic annotation: https://www.dwds.de/r?corpus=politische_reden

Best regards, Adrien Barbaresi

More information about the Corpora mailing list