[Corpora-List] Corpora and coronavirus

Mark Davies Mark_Davies at byu.edu
Fri Mar 27 00:43:10 CET 2020


Following up on Milos Jakubicek's comments about corpora and COVID-19, some may also be interested in using the NOW Corpus to investigate what is being said about the coronavirus, social distancing, sheltering in place, flattening the curve, herd immunity, etc.

https://www.english-corpora.org/now/ (click on the "coronavirus" link, for many sample searches)

The NOW corpus is continually updated (6-12 million words each day; 200-250 million words each month; almost 10 billion words overall), which makes it a very useful corpus for looking at recent topics. As of March 25, NOW has 155,605 articles that mention coronavirus or COVID-19 -- and 44,755 articles with either term in just the last 7 days.

NOW allows users to track the frequency of a word or phrase over time since 2010 (including in 10 day increments), to limit and compare by time period and country, to see collocates of any word or phrase, and to quickly and easily create "Virtual Corpora" based on metadata and/or words or phrases in the texts.

Most of the data from NOW (95% of the original text, or about 9.5 billion words) is also available for download and offline use from:

https://www.corpusdata.org/

Best,

Mark Davies

============================================ Mark Davies Professor of Linguistics / Brigham Young University http://davies-linguistics.byu.edu/

** Corpus design and use // Linguistic databases ** ** Historical linguistics // Language variation ** ** English, Spanish, and Portuguese ** ============================================



More information about the Corpora mailing list