[Corpora-List] Distribution of tokens by POS in BNC or COCA

Mark Davies Mark_Davies at byu.edu
Sun Mar 23 02:54:15 CET 2014



>> Is there a table which lists the contents of BNC or COCA by POS - NN, NNP, JJ, VB and their variations?

You can quickly and easily generate the top 4000-5000 word forms for a given PoS (nn2, jjr, etc) via the web interfaces for COCA or the BNC (http://corpus.byu.edu).

Might also look at http://www.wordfrequency.info/100k.asp (a bit pricey, though, unless you need all of that info.)

Best,

Mark Davies

============================================ Mark Davies Professor of Linguistics / Brigham Young University http://davies-linguistics.byu.edu/

** Corpus design and use // Linguistic databases ** ** Historical linguistics // Language variation ** ** English, Spanish, and Portuguese ** ============================================

________________________________________ From: corpora-bounces at uib.no [corpora-bounces at uib.no] on behalf of Khurshid Ahmad [kahmad at scss.tcd.ie] Sent: Saturday, March 22, 2014 12:15 PM To: corpora at uib.no Subject: [Corpora-List] Distribution of tokens by POS in BNC or COCA

Dear All Is there a table which lists the contents of BNC or COCA by POS - NN, NNP, JJ, VB and their variations? Apologies for using the bandwidth for such a simple query.

-- Best wishes

Khurshid Ahmad. Professor of Computer Science School of Computer Science and Statistics Trinity College Dublin 2 IRELAND

Phone: 00353 1 896 8429 (Labs: 00 353 1 8968435) Fax 353 1 677 2204 Webpage: www.cs.tcd.ie/khurshid.ahmad

_______________________________________________ UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora Corpora mailing list Corpora at uib.no http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list