Multi-Objective Intrinsic Reward Learning for Conversational Recommender Systems
–Neural Information Processing Systems
Conversational Recommender Systems (CRS) actively elicit user preferences to generate adaptive recommendations. Mainstream reinforcement learning-based CRS solutions heavily rely on handcrafted reward functions, which may not be aligned with user intent in CRS tasks.
Neural Information Processing Systems
Oct-8-2025, 11:34:50 GMT