Error Bounds of Imitating Policies and Environments
–Neural Information Processing Systems
Imitation learning trains a policy by mimicking expert demonstrations. V arious imitation methods were proposed and empirically evaluated, meanwhile, their theoretical understanding needs further studies. In this paper, we firstly analyze the value gap between the expert policy and imitated policies by two imitation methods, behavioral cloning and generative adversarial imitation.
Neural Information Processing Systems
Nov-20-2025, 09:32:53 GMT
- Country:
- Asia
- China
- Guangdong Province > Shenzhen (0.04)
- Hong Kong (0.04)
- Jiangsu Province > Nanjing (0.04)
- Japan > Honshū
- Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
- Middle East > Jordan (0.04)
- China
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > Canada (0.04)
- Asia
- Technology: