Multi-Step Generalized Policy Improvement by Leveraging Approximate Models Lucas N. Alegre 1, 2 Ana L. C. Bazzan 1 Ann Now é 2 Bruno C. da Silva 3 1
–Neural Information Processing Systems
We introduce a principled method for performing zero-shot transfer in reinforcement learning (RL) by exploiting approximate models of the environment. Zero-shot transfer in RL has been investigated by leveraging methods rooted in generalized policy improvement (GPI) and successor features (SFs).
Neural Information Processing Systems
Feb-15-2026
- Country:
- Asia
- China (0.04)
- Japan > Honshū
- Chūbu > Toyama Prefecture > Toyama (0.04)
- Macao (0.04)
- Middle East > Jordan (0.04)
- Europe
- North America
- Puerto Rico (0.04)
- United States
- California > San Diego County
- San Diego (0.04)
- Massachusetts > Middlesex County
- Belmont (0.04)
- California > San Diego County
- South America > Brazil
- Rio Grande do Sul (0.04)
- Asia
- Genre:
- Research Report > New Finding (0.68)
- Technology: