as you might have already noticed, the Allen Institute for AI has released an open dataset comprising 29,315 research papers on the Covid-19 topic.
To help people analysing the texts and ease the access, we have processed the dataset into a searchable corpus in Sketch Engine.
*The corpus is in the "open" category: no account is required to get access*
:
*https://app.sketchengine.eu/#dashboard?corpname=preloaded%2Fcovid19 <https://app.sketchengine.eu/#dashboard?corpname=preloaded%2Fcovid19>*
or search through:
*https://app.sketchengine.eu/#open* <https://app.sketchengine.eu/#open>
Some functionalities like building user subcorpora require having an account. Please create a trial account and email your username to inquiries at sketchengine.eu with the subject "Covid 19 corpus" and we will give you a free account to access this corpus. EU researchers will typically have free access through the ELEXIS infrastructure <https://elex.is>.
More information, including download of the tokenized, part-of-speech tagged and lemmatized vertical text available at:
*https://www.sketchengine.eu/covid19/ <https://www.sketchengine.eu/covid19/>*
All the best & hang in everybody, wherever you are, Milos Jakubicek
CEO, Lexical Computing Brno, CZ | Brighton, UK http://www.lexicalcomputing.com http://www.sketchengine.co.uk -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 2140 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20200326/7bb621b4/attachment.txt>