Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees
–Neural Information Processing Systems
Inverse reinforcement learning (IRL) aims to recover the reward function and the associated optimal policy that best fits observed sequences of states and actions implemented by an expert.
Neural Information Processing Systems
Aug-14-2025, 10:34:51 GMT
- Country:
- North America > United States
- Texas > Brazos County
- College Station (0.14)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Illinois > Cook County
- Chicago (0.04)
- Texas > Brazos County
- Asia
- Middle East > Jordan (0.04)
- China
- Hong Kong (0.04)
- Guangdong Province > Shenzhen (0.04)
- North America > United States
- Genre:
- Research Report (0.46)
- Industry:
- Education (0.68)