[Corpora-List] Document summarization evaluation dataset needed

Jeff Kubina jeff.kubina at gmail.com
Thu Dec 11 02:07:10 CET 2014


You might consider the 2013 MultiLing single document summarization dataset <http://goo.gl/LsGVYE>, derived from featured Wikipedia articles.

Also, you should be able to get the DUC 2002 datasets from NIST <http://www-nlpir.nist.gov/projects/duc/data/2002_data.html>.

Cheers, Jeff

-- Jeff Kubina 410-988-4436

On Wed, Dec 10, 2014 at 3:24 PM, Tomáš Kočiský <tomas at kocisky.eu> wrote:

> Hi All,
> Could anyone provide me with pointers to datasets for *evaluating
> (single) document summarization* (extractive and/or abstractive) for
> research purposes? I was unable to obtain the DUC datasets.
> Alternatively, if you have any of the DUC datasets please contact me!
> Many thanks,
> Tomas Kocisky
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 1787 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20141210/aaccd65b/attachment.txt>

More information about the Corpora mailing list