Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees
–Neural Information Processing Systems
Inverse reinforcement learning (IRL) aims to recover the reward function and the associated optimal policy that best fits observed sequences of states and actions implemented by an expert.
Neural Information Processing Systems
Aug-14-2025, 10:34:51 GMT
- Country:
- Asia
- China
- Guangdong Province > Shenzhen (0.04)
- Hong Kong (0.04)
- Middle East > Jordan (0.04)
- China
- North America > United States
- Illinois > Cook County
- Chicago (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Texas > Brazos County
- College Station (0.14)
- Illinois > Cook County
- Asia
- Genre:
- Research Report (0.46)
- Industry:
- Education (0.68)