Toward Holistic Evaluation of Recommender Systems Powered by Generative Models
Deldjoo, Yashar, Mehta, Nikhil, Sathiamoorthy, Maheswaran, Zhang, Shuai, Castells, Pablo, McAuley, Julian
–arXiv.org Artificial Intelligence
Recommender systems powered by generative models (Gen-RecSys) extend beyond classical item ranking by producing open-ended content, which simultaneously unlocks richer user experiences and introduces new risks. On one hand, these systems can enhance personalization and appeal through dynamic explanations and multi-turn dialogues. On the other hand, they might venture into unknown territory-hallucinating nonexistent items, amplifying bias, or leaking private information. Traditional accuracy metrics cannot fully capture these challenges, as they fail to measure factual correctness, content safety, or alignment with user intent. This paper makes two main contributions. First, we categorize the evaluation challenges of Gen-RecSys into two groups: (i) existing concerns that are exacerbated by generative outputs (e.g., bias, privacy) and (ii) entirely new risks (e.g., item hallucinations, contradictory explanations). Second, we propose a holistic evaluation approach that includes scenario-based assessments and multi-metric checks-incorporating relevance, factual grounding, bias detection, and policy compliance. Our goal is to provide a guiding framework so researchers and practitioners can thoroughly assess Gen-RecSys, ensuring effective personalization and responsible deployment.
arXiv.org Artificial Intelligence
Jul-11-2025
- Country:
- Asia > Myanmar
- Tanintharyi Region > Dawei (0.04)
- Europe
- Denmark > Capital Region
- Copenhagen (0.05)
- Italy > Apulia
- Bari (0.04)
- Spain > Galicia
- Madrid (0.04)
- Switzerland > Zürich
- Zürich (0.14)
- Denmark > Capital Region
- North America
- Mexico > Mexico City
- Mexico City (0.04)
- United States
- California > San Diego County
- San Diego (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- New York > New York County
- New York City (0.04)
- North Carolina > Durham County
- Durham (0.04)
- California > San Diego County
- Mexico > Mexico City
- Asia > Myanmar
- Genre:
- Overview (0.93)
- Industry:
- Information Technology (0.46)
- Media (0.46)
- Transportation (0.46)
- Technology: