[Corpora-List] COVID-19 Open Research Dataset (CORD-19) available as open corpus in Sketch Engine

Miloš Jakubíček milos.jakubicek at sketchengine.co.uk
Thu Mar 26 18:03:17 CET 2020


Dear all,

as you might have already noticed, the Allen Institute for AI has released an open dataset comprising 29,315 research papers on the Covid-19 topic.

To help people analysing the texts and ease the access, we have processed the dataset into a searchable corpus in Sketch Engine.

*The corpus is in the "open" category: no account is required to get access*
:

*https://app.sketchengine.eu/#dashboard?corpname=preloaded%2Fcovid19 <https://app.sketchengine.eu/#dashboard?corpname=preloaded%2Fcovid19>*

or search through:

*https://app.sketchengine.eu/#open* <https://app.sketchengine.eu/#open>

Some functionalities like building user subcorpora require having an account. Please create a trial account and email your username to inquiries at sketchengine.eu with the subject "Covid 19 corpus" and we will give you a free account to access this corpus. EU researchers will typically have free access through the ELEXIS infrastructure <https://elex.is>.

More information, including download of the tokenized, part-of-speech tagged and lemmatized vertical text available at:

*https://www.sketchengine.eu/covid19/ <https://www.sketchengine.eu/covid19/>*

All the best & hang in everybody, wherever you are, Milos Jakubicek

CEO, Lexical Computing Brno, CZ | Brighton, UK http://www.lexicalcomputing.com http://www.sketchengine.co.uk -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 2140 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20200326/7bb621b4/attachment.txt>



More information about the Corpora mailing list