You might consider the 2013 MultiLing single document summarization dataset <http://goo.gl/LsGVYE>, derived from featured Wikipedia articles.
Also, you should be able to get the DUC 2002 datasets from NIST <http://www-nlpir.nist.gov/projects/duc/data/2002_data.html>.
-- Jeff Kubina 410-988-4436
On Wed, Dec 10, 2014 at 3:24 PM, Tomáš Kočiský <tomas at kocisky.eu> wrote:
> Hi All,
> Could anyone provide me with pointers to datasets for *evaluating
> (single) document summarization* (extractive and/or abstractive) for
> research purposes? I was unable to obtain the DUC datasets.
> Alternatively, if you have any of the DUC datasets please contact me!
> Many thanks,
> Tomas Kocisky
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 1787 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20141210/aaccd65b/attachment.txt>