Multi-Modal Inverse Constrained Reinforcement Learning from a Mixture of Demonstrations

Jan-19-2025, 20:47:21 GMT–Neural Information Processing Systems

Inverse Constraint Reinforcement Learning (ICRL) aims to recover the underlying constraints respected by expert agents in a data-driven manner. Existing ICRL algorithms typically assume that the demonstration data is generated by a single type of expert. However, in practice, demonstrations often comprise a mixture of trajectories collected from various expert agents respecting different constraints, making it challenging to explain expert behaviors with a unified constraint function. To tackle this issue, we propose a Multi-Modal Inverse Constrained Reinforcement Learning (MMICRL) algorithm for simultaneously estimating multiple constraints corresponding to different types of experts. MMICRL constructs a flow-based density estimator that enables unsupervised expert identification from demonstrations, so as to infer the agent-specific constraints.

constraint, demonstration, multi-modal inverse constrained reinforcement learning, (3 more...)

Neural Information Processing Systems

Jan-19-2025, 20:47:21 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (0.92)
  - Machine Learning > Reinforcement Learning (0.89)