Multi-Objective Intrinsic Reward Learning for Conversational Recommender Systems

Oct-11-2024, 07:07:05 GMT–Neural Information Processing Systems

Conversational Recommender Systems (CRS) actively elicit user preferences to generate adaptive recommendations. Mainstream reinforcement learning-based CRS solutions heavily rely on handcrafted reward functions, which may not be aligned with user intent in CRS tasks. Therefore, the design of task-specific rewards is critical to facilitate CRS policy learning, which remains largely under-explored in the literature. In this work, we propose a novel approach to address this challenge by learning intrinsic rewards from interactions with users. Specifically, we formulate intrinsic reward learning as a multi-objective bi-level optimization problem.

conversational recommender system, intrinsic reward, multi-objective intrinsic reward learning, (1 more...)

Neural Information Processing Systems

Oct-11-2024, 07:07:05 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report (0.44)

Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)