Multi-Step Generalized Policy Improvement by Leveraging Approximate Models Lucas N. Alegre 1, 2 Ana L. C. Bazzan 1 Ann Now é 2 Bruno C. da Silva 3 1

Feb-15-2026–Neural Information Processing Systems

We introduce a principled method for performing zero-shot transfer in reinforcement learning (RL) by exploiting approximate models of the environment. Zero-shot transfer in RL has been investigated by leveraging methods rooted in generalized policy improvement (GPI) and successor features (SFs).

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Feb-15-2026

Conferences PDF

Add feedback

Country:
- South America > Brazil
  - Rio Grande do Sul (0.04)
- North America
  - Puerto Rico (0.04)
  - United States
    - Massachusetts > Middlesex County
      - Belmont (0.04)
    - California > San Diego County
      - San Diego (0.04)
- Europe
  - Portugal (0.04)
  - Belgium > Flanders (0.04)
  - Sweden > Stockholm
    - Stockholm (0.04)
- Asia
  - Middle East > Jordan (0.04)
  - Macao (0.04)
  - China (0.04)
  - Japan > Honshū
    - Chūbu > Toyama Prefecture > Toyama (0.04)

Genre:
- Research Report > New Finding (0.68)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Representation & Reasoning (1.00)
  - Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
77c7faab15002432ba1151e8d5cc389a-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found