I'm looking for a chunk-annotated version of the Penn Treebank. It seems to be the most popular resource for training and testing chunking software, but I haven't been able to find a chunked version or an algorithm for extracting chunks in a deterministic way. Is there a standard resource that everybody uses or does everybody just extract the chunks from the parsed data themselves?
Best, Aleksandar Savkov -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 499 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20120813/5dfdbc13/attachment.txt>