DocumentCode
864
Title
Assessing Box Office Performance Using Movie Scripts: A Kernel-Based Approach
Author
Eliashberg, Jehoshua ; Hui, Sam K. ; Zhang, Zhongwei Jake
Author_Institution
Wharton Sch., Univ. of Pennsylvania, Philadelphia, PA, USA
Volume
26
Issue
11
fYear
2014
fDate
Nov. 2014
Firstpage
2639
Lastpage
2648
Abstract
We develop a methodology to predict box office performance of a movie at the point of green-lighting, when only its script and estimated production budget are available. We extract three levels of textual features (genre and content, semantics, and bag-of-words) from scripts using screenwriting domain knowledge, human input, and natural language processing techniques. These textual variables define a distance metric across scripts, which is then used as an input for a kernel-based approach to assess box office performance. We show that our proposed methodology predicts box office revenues more accurately (29 percent lower mean squared error (MSE)) compared to benchmark methods.
Keywords
feature extraction; humanities; natural language processing; text analysis; MSE; bag-of-words; benchmark methods; box office performance assessment; distance metric; estimated production budget; green-lighting; human input; kernel-based approach; mean squared error; movie scripts; natural language processing techniques; screenwriting domain knowledge; textual feature extraction; textual variables; Benchmark testing; Educational institutions; Feature extraction; Measurement; Motion pictures; Portfolios; Production; Entertainment industry; green-lighting; kernel approach; movie production; text mining;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2014.2306681
Filename
6746657
Link To Document