Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation

Oct-11-2024, 08:23:47 GMT–Neural Information Processing Systems

Learning new task-specific skills from a few trials is a fundamental challenge for artificial intelligence. Meta reinforcement learning (meta-RL) tackles this problem by learning transferable policies that support few-shot adaptation to unseen tasks. Despite recent advances in meta-RL, most existing methods require the access to the environmental reward function of new tasks to infer the task objective, which is not realistic in many practical applications. To bridge this gap, we study the problem of few-shot adaptation in the context of human-in-the-loop reinforcement learning. We develop a meta-RL algorithm that enables fast policy adaptation with preference-based feedback.

efficient meta reinforcement learning, few-shot adaptation, preference-based fast adaptation, (2 more...)

Neural Information Processing Systems

Oct-11-2024, 08:23:47 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)