Multi-Objective Intrinsic Reward Learning for Conversational Recommender Systems

Neural Information Processing Systems 

Conversational Recommender Systems (CRS) actively elicit user preferences to generate adaptive recommendations. Mainstream reinforcement learning-based CRS solutions heavily rely on handcrafted reward functions, which may not be aligned with user intent in CRS tasks.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found