Reviews: Repeated Inverse Reinforcement Learning

Oct-8-2024, 03:46:41 GMT–Neural Information Processing Systems

The authors present a learning framework for inverse reinforcement learning wherein an agent provides policies for a variety of related tasks and a human determines whether or not the produced policies are acceptable or not. They present algorithms for learning a human's latent reward function over the tasks, and they give upper and lower bounds on the performance of the algorithms. They also address the setting where an agent's is "corrected" as it executes trajectories. This is a comprehensive theoretical treatment of a new conceptualization of IRL that I think is valuable. I have broad clarification/scoping questions and a few minor points.

algorithm, optimal policy, repeated inverse reinforcement learning, (6 more...)

Neural Information Processing Systems

Oct-8-2024, 03:46:41 GMT

Conferences Web Page

Add feedback

Genre:
- Summary/Review (0.74)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)