[Corpora-List] corpora of grammatical errors

Ng Hwee Tou nght at nus.edu.sg
Tue Apr 24 01:15:41 CEST 2012

Hi all,

The NUS Corpus of Learner English (NUCLE) consists of about 1,400 essays written by university students at the National University of Singapore. It contains over one million words which are completely annotated with error tags and corrections. All annotations have been performed by professional English instructors. The corpus is distributed under the standard NUS licensing agreement and can be downloaded from the NUS Enterprise R2M portal:


The corpus was first reported in the following paper:

Dahlmeier, Daniel, & Ng, Hwee Tou (2011). Grammatical Error Correction with Alternating Structure Optimization. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics (ACL 2011). (pp. 915 - 923). Portland, Oregon, USA.

Hwee Tou Ng

From: corpora-bounces at uib.no [mailto:corpora-bounces at uib.no] On Behalf Of Anabela Barreiro Sent: Saturday, 14 April, 2012 6:25 PM To: corpora at uib.no Subject: [Corpora-List] corpora of grammatical errors

Dear Corpora List Members,

I am looking for public corpora containing sentences with grammatical errors.

I plan to use the corpora as input to grammar checking and correction routines.

The corpora can be in English or romance languages.

I appreciate any indication of where I can find those corpora.

Thank you! ------------------------------------------------------------------------------------------------- Think GREEN - Act GREEN!

Anabela M. Barreiro Personal webpage: https://www.l2f.inesc-id.pt/wiki/index.php/Anabela_Barreiro LinkedIn: http://www.linkedin.com/in/anabelabarreiro <http://www.linkedin.com/pub/3/219/A43> ------------------------------------------------------------------------------------------------- -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 6735 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20120424/0e8b4f97/attachment.txt>

More information about the Corpora mailing list