[Corpora-List] cost of treebanking

Jona Schuman j_schuma at cs.concordia.ca
Thu Aug 16 22:23:01 CEST 2012


Would anyone happen to know of any estimates of the number of man-hours that went into annotating the Penn Treebank (or alternatively, the Genia Treebank)? There are some rough figures in (Marcus, et al. 1993. Building a Large Annotated Corpus of English: The Penn Treebank), but these seem to be hypothetical projections and only applicable to the earlier, preliminary version of PTB.

Thanks, Jona

More information about the Corpora mailing list