Movie Box Office Prediction With Self-Supervised and Visually Grounded Pretraining
Chao, Qin, Kim, Eunsoo, Li, Boyang
–arXiv.org Artificial Intelligence
Investments in movie production are associated with a high level of risk as movie revenues have long-tailed and bimodal distributions. Accurate prediction of box-office revenue may mitigate the uncertainty and encourage investment. However, learning effective representations for actors, directors, and user-generated content-related keywords remains a challenging open problem. In this work, we investigate the effects of self-supervised pretraining and propose visual grounding of content keywords in objects from movie posters as a pertaining objective. Experiments on a large dataset of 35,794 movies demonstrate significant benefits of self-supervised training and visual grounding. In particular, visual grounding pretraining substantially improves learning on movies with content keywords and achieves 14.5% relative performance gains compared to a finetuned BERT model with identical architecture.
arXiv.org Artificial Intelligence
Apr-20-2023
- Country:
- Asia > Singapore (0.05)
- Europe > France
- Île-de-France > Paris > Paris (0.04)
- North America > United States
- California > Los Angeles County > Los Angeles (0.14)
- Genre:
- Research Report (0.50)
- Industry:
- Leisure & Entertainment (1.00)
- Media > Film (1.00)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning > Neural Networks (0.93)
- Natural Language (1.00)
- Vision (1.00)
- Information Technology > Artificial Intelligence