[Corpora-List] English corpus for specific domains

liling tan alvations at gmail.com
Thu Nov 13 15:33:41 CET 2014


Dear linguists,

Traditional corpora such as British National Corpus, American COCA corpus and International Corpus of English holds on to the notion of a balance corpus and allowed corpora of different registers, domains and types.

Web corpora like wikipedia corpora, web as corpus corpora and many others used crawling techniques or crowdsourcing texts for compilation and it also ends up with some sort of balance corpora.

Thus finding corpora for specific domains is a task of resourcefulness. And we require your help in locating them.

Are there corpora that are specifically for the following domain:

- *Chemical*: the taxonomy rooted on "chemical", examples of terminology

concepts are ("ammonium carbonate", "beta hydroxybutyric acid", "butyl

rubber" );

- *Equipment*: the taxonomy rooted on "equipment", examples of

terminology concepts are ("acoustic modem", "parasail", "clock pendulum");

- *Food*: the taxonomy rooted on "food", examples of terminology

concepts are ("jacket potato", "lemonade", "bolognese pasta sauce");

- *Science*: the taxonomy rooted on "science", examples of terminology

concepts are ( "neuropsychiatry", "craniometry", "microelectronics");

Best Regards, Liling -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 1589 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20141113/2122e2a6/attachment.txt>



More information about the Corpora mailing list