[Corpora-List] Cornell Movie-Quotes Corpus
Cristian Danescu-Niculescu-Mizil
cristiand at cs.stanford.edu
Tue Aug 14 17:40:28 CEST 2012
Announcing the availability of the Cornell Movie-Quotes Corpus, a large collection of movie scripts with memorability annotations. The data includes about 900,000 movie script lines from over 1,000 movies. Out of these, 6,282 lines are matched to IMDb memorable quotes. This corpus is released together with the paper:
"You had me at hello: How phrasing affects memorability"
Cristian Danescu-Niculescu-Mizil, Justin Cheng, Jon Kleinberg and Lillian Lee
ACL 2012
The download site is:
http://www.cs.cornell.edu/~cristian/memorability.html
(This corpus is complementary to the Cornell Movie-Dialogs Corpus released this July.)
Cristian Danescu-Niculescu-Mizil, Justin Cheng, Jon Kleinberg and Lillian Lee
More information about the Corpora
mailing list