Predicting Movie Hits Before They Happen with LLMs

Published 5 May 2025 in cs.IR and cs.CL | (2505.02693v1)

Abstract: Addressing the cold-start issue in content recommendation remains a critical ongoing challenge. In this work, we focus on tackling the cold-start problem for movies on a large entertainment platform. Our primary goal is to forecast the popularity of cold-start movies using LLMs leveraging movie metadata. This method could be integrated into retrieval systems within the personalization pipeline or could be adopted as a tool for editorial teams to ensure fair promotion of potentially overlooked movies that may be missed by traditional or algorithmic solutions. Our study validates the effectiveness of this approach compared to established baselines and those we developed.

Abstract PDF Upgrade to Chat

Summary

An Overview of Predicting Movie Popularity Using Large Language Models

The paper discusses an innovative approach to addressing the cold-start problem in movie recommendation systems by leveraging Large Language Models (LLMs). This effort is particularly pertinent for large-scale entertainment platforms where content promotion based on popularity forecasts is critical yet challenging due to rapid content churn and evolving user preferences.

The study introduces a methodology to predict the popularity of newly released movies by utilizing LLMs to analyze movie metadata. The approach is proposed as an enhancement over traditional editorial processes and existing machine learning methods, which can struggle with biases and scalability in evaluating new content. Promising numerical results are presented, demonstrating that the deployment of LLMs surpasses existing baselines in predicting movie hits.

Methodology and Dataset

The paper outlines a multi-step methodology:

Dataset Construction: A benchmark dataset was curated from a real-world entertainment platform, encompassing tens of millions of user interactions. Unlike conventional datasets, this set was designed to track the trajectory of movies from release to peak popularity, and it includes comprehensive metadata like genre, synopsis, content ratings, cast, crew, awards, and more.
Baseline Models: Baselines were established using embeddings from BERT and other state-of-the-art models to generate semantic representations of the metadata, with ranking determined by similarity to embeddings of previously popular content.
LLM Evaluation: LLMs, specifically variants of the Llama model, were used to predict movie popularity through listwise ranking. Intensive prompt engineering was employed to optimize LLM performance in generating accurate predictions accompanied by reasoning.

Experimental Insights

Key insights from the experiments highlight that LLMs can effectively predict the popularity potential of movies before they gain significant audience traction. Through ablation studies, the LLM performance was evaluated using various subsets of metadata, revealing that richer information content leads to improved model predictions, especially with larger model sizes. The paper emphasizes the role of sophisticated prompt engineering to guide the LLMs in extracting and deploying contextual understanding from movie metadata.

Significantly, the study shows the variability and sensitivity of LLM performance to prompt design and model size, indicating the need for adaptive strategies in their application. The results suggest additional attributes such as cast awards substantially enhance model performance, showcasing the models' ability to generalize from provided information without explicit external data.

Implications and Future Work

The findings have broad practical implications for content recommendation systems, as they could be integrated into editorial workflows and automated content promotion pipelines to enhance user engagement and discoverability of new movies. By offering early insights into content popularity, this methodology can support editorial teams to make informed promotion decisions outside the conventional reliance on interaction data.

Looking ahead, the paper speculates on the evolving efficiency of LLMs and the potential emergence of recommendation agents, positing that insights derived from this research are adaptable to various domains beyond entertainment. Future developments could focus on refining parsing techniques, exploring domain-specific metadata enhancements, and expanding the applicability of LLMs across diverse content types. This line of research may continue to disrupt traditional content promotion strategies by offering scalable and unbiased predictions for burgeoning content landscapes.