Distributionally Robust Imitation Learning
–Neural Information Processing Systems
Decision Process (MDP) setting where the reward function is not given, but demonstrations from experts are available. Although the goal of imitation learning is to learn a policy that produces behaviors nearly as good as the experts' for a desired task, assumptions of consistent optimality for demonstrated behaviors are often violated in practice.
Neural Information Processing Systems
Nov-15-2025, 17:12:58 GMT
- Country:
- North America > United States > Illinois > Cook County > Chicago (0.04)
- Genre:
- Research Report (0.93)