Maximum-Likelihood InverseReinforcementLearning withFinite-TimeGuarantees
–Neural Information Processing Systems
Inverse reinforcement learning (IRL) aims to recover the reward function and the associated optimal policy that best fits observed sequences of states and actions implemented byanexpert.
Neural Information Processing Systems
Feb-8-2026, 14:03:46 GMT
- Country:
- Asia
- China > Guangdong Province
- Shenzhen (0.04)
- Middle East > Jordan (0.04)
- China > Guangdong Province
- North America > United States
- Illinois > Cook County
- Chicago (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Texas > Brazos County
- College Station (0.04)
- Illinois > Cook County
- Asia