[Corpora-List] corpora of grammatical errors

Anabela Barreiro barreiro_anabela at hotmail.com
Mon Apr 16 12:37:21 CEST 2012


This is perfect, Cerstin! This is exactly the kind of corpora I need. Why don't you join Hoo (read my previous e-mail)? :) Thank you very much!

Anabela.> Date: Mon, 16 Apr 2012 12:17:06 +0200
> From: cerstin.mahlow at unibas.ch
> To: corpora at uib.no
> Subject: Re: [Corpora-List] corpora of grammatical errors
>
>
> Dear Anabela,
>
> Zitat von Anabela Barreiro:
>
> > I am looking for public corpora containing sentences with grammatical errors.
> >
> > I plan to use the corpora as input to grammar checking and
> > correction routines.
> >
> > The corpora can be in English or romance languages. I appreciate any
> > indication of where I can find those corpora. Thank you!
>
> It's not exactly the language you are looking for, but:
>
> For my dissertation I collected more than 200 ungrammatical sentences
> in German from published sources (newspapers, books, advertisements,
> letters, etc.). Ungrammatical here means:
>
> - errors concerning agreement
> - wrong word order
> - duplicate or missing words
>
> We are about to release this resource as an annotated corpus -- at the
> moment I could sent you the sentences together with a comment each
> (concerning the error and a potential correct version of this sentence).
>
> Best regards
>
> Cerstin
>
> --
> Dr. phil. Cerstin Mahlow
>
> Universitšt Basel
> Departement Sprach- und Literaturwissenschaften
> Fachbereich Deutsche Sprach- und Literaturwissenschaft
> Nadelberg 4
> 4051 Basel
> Schweiz
>
> Tel: +41 61 267 07 65
> Fax: +41 61 267 34 40
> Mail: cerstin.mahlow at unibas.ch
> Web: http://www.oldphras.net
>
> ----------------------------------------------------------------
> This message was sent using IMP, the Internet Messaging Program.
>
>
>
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora

-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 2645 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20120416/79232440/attachment.txt>



More information about the Corpora mailing list