[Corpora-List] Legal-domain corpora

Stella Neumann st.neumann at mx.uni-saarland.de
Wed Oct 18 18:29:00 CEST 2006


Seth,
check out the HOLJ Corpus built in the framework of the SUM project in
Edinburgh (http://www.ltg.ed.ac.uk/SUM/index.html).
It contains court decisions by the House of Lords, is annotated and can
be downloaded for free.
Best,
Stella

Seth Grimes schrieb:

> Hello all,

>

> I'm researching legal-domain application of NLP with machine

> learning. What annotated corpora are available in this domain, either for

> free or for a license fee? I'd be interested in --

>

> - legislation and statutes

> - case law

> - briefs, depositions & testimony, crime reports, and evidentiary

> materials

> - court judgments

> - patent filings

>

> -- and also in parallel, multi-lingual corpora, for instance that might

> have been created in the EU, Switzerland, Canada, and other areas with

> multiple official languages.

>

> I've been told that news-media text can provide good training

> material for the legal domain. I'd also be interested in hearing

> reactions to that claim, especially if anyone has formally studied the

> question.

>

> Thanks very much for all help,

>

> Seth

>

>

> --

> Seth Grimes Alta Plana Corp, analytical computing & data management

> Intelligent Enterprise magazine (CMP), Contributing Editor

> grimes at altaplana.com http://altaplana.com 301-270-0795

>


--
Dr. Stella Neumann
Englische Sprach- und
Übersetzungswissenschaft

Universität des Saarlandes
Fachrichtung 4.6
Angewandte Sprachwissenschaft
sowie Übersetzen und Dolmetschen
Postfach 15 11 50
D-66041 Saarbrücken

Tel.: +49(681) 302 64307
Fax: +49(681) 302 64375
e-mail: st.neumann at mx.uni-saarland.de

http://fr46.uni-saarland.de/steiner.php
http://www.uni-saarland.de/~st.neumann





More information about the Corpora-archive mailing list