[Corpora-List] Seeking corpus for academic domain

Krishnamurthy, Ramesh r.krishnamurthy at aston.ac.uk
Fri Aug 3 11:15:54 CEST 2012

Dear Lushan

Please see


for a critical summary of corpora of academic english.

best wishes


Ramesh Krishnamurthy

Visiting Academic Fellow

Aston University


From: corpora-bounces at uib.no [mailto:corpora-bounces at uib.no] On Behalf Of Lushan Han Sent: 18 July 2012 20:30 To: corpora at uib.no Subject: Re: [Corpora-List] Seeking corpus for academic domain

A corpus of smaller size (e.g. millions of words) can also be very helpful to me. Please inform me if you happen to know it.



On Wed, Jul 18, 2012 at 11:03 AM, Lushan Han <lushan1 at umbc.edu> wrote:

Dear all,

I am looking for a very large corpus ( > 1 billion words) made for academic domain, mainly describing university, project, conference, paper, author and etc. I will compute statistics from it, which is used in building a query system on structured data for academic domain.

Does anyone know such a corpus? Any information will be appreciated.


Lushan Han

More information about the Corpora mailing list