[Corpora-List] Query: Corpora of American and British English that can be compared?

Adam Kilgarriff adam at lexmasterclass.com
Thu Dec 20 11:26:45 CET 2012


Dear Laure,

the straightforward answer is the 'Brown family' corpora - Brown and LOB were compiled with just this kind of analysis in mind: they were both 1961 and more comparable data points are available for 1991 (FROWN and FLOB) and (tho maybe this is British Englsih only) 1931, 1901 and 2006.

You can do the comparisons easily and directly in the Sketch Engine, where the data is already set up (includiung POS-tagged) and the 'Brown family' corpus contains all the above except the 1901 part.

Regards

Adam

On 18 December 2012 09:23, Laure Gardelle <laure.gardelle at ens-lyon.fr>wrote:


> Dear colleagues,
>
> For my research I need to compare one set of agreement patterns in
> American and British English.
> So would anyone know of two corpora (one for American English, the other
> for British English) that would have sufficiently close collection
> procedures for the hits they return to be compared (ie. for possible
> differences in proportion to be considered meaningful)?? Ideally I am
> looking for contemporary English, but if the data are a bit older, it is
> not a problem.
>
> Many thanks in advance for any help with this!
>
> Laure Gardelle
>
> ______________________________**_________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/**corpora<http://mailman.uib.no/options/corpora>
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/**listinfo/corpora<http://mailman.uib.no/listinfo/corpora>
>

-- ======================================== Adam Kilgarriff <http://www.kilgarriff.co.uk/> adam at lexmasterclass.com Director Lexical Computing Ltd<http://www.sketchengine.co.uk/>

Visiting Research Fellow University of Leeds<http://leeds.ac.uk>

*Corpora for all* with the Sketch Engine <http://www.sketchengine.co.uk>

*DANTE: a lexical database for English<http://www.webdante.com>

* ======================================== -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 2966 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20121220/a70dd24a/attachment.txt>



More information about the Corpora mailing list