[Corpora-List] Cornell Movie-Quotes Corpus

Cristian Danescu-Niculescu-Mizil cristiand at cs.stanford.edu
Tue Aug 14 17:40:28 CEST 2012


Announcing the availability of the Cornell Movie-Quotes Corpus, a large collection of movie scripts with memorability annotations. The data includes about 900,000 movie script lines from over 1,000 movies. Out of these, 6,282 lines are matched to IMDb memorable quotes. This corpus is released together with the paper:

"You had me at hello: How phrasing affects memorability" Cristian Danescu-Niculescu-Mizil, Justin Cheng, Jon Kleinberg and Lillian Lee ACL 2012

The download site is: http://www.cs.cornell.edu/~cristian/memorability.html

(This corpus is complementary to the Cornell Movie-Dialogs Corpus released this July.)

Cristian Danescu-Niculescu-Mizil, Justin Cheng, Jon Kleinberg and Lillian Lee



More information about the Corpora mailing list