Multi-Objective Intrinsic Reward Learning for Conversational Recommender Systems
–Neural Information Processing Systems
Conversational Recommender Systems (CRS) actively elicit user preferences to generate adaptive recommendations. Mainstream reinforcement learning-based CRS solutions heavily rely on handcrafted reward functions, which may not be aligned with user intent in CRS tasks.
Neural Information Processing Systems
Nov-15-2025, 08:03:01 GMT
- Country:
- North America > United States
- California > Santa Clara County
- Los Gatos (0.04)
- Virginia (0.05)
- California > Santa Clara County
- North America > United States
- Genre:
- Research Report > New Finding (0.46)
- Technology: