Learning General World Models in a Handful of Reward-Free Deployments Yingchen Xu UCL, FAIR Jack Parker-Holder University of Oxford Aldo Pacchiano Microsoft Research Philip J. Ball

Aug-17-2025, 14:10:15 GMT–Neural Information Processing Systems

Combining these two properties, we introduce the reward-free deployment efficiency setting, a new paradigm for RL research.

artificial intelligence, international conference, machine learning, (13 more...)

Neural Information Processing Systems

Aug-17-2025, 14:10:15 GMT

Conferences PDF

Country:
- North America > United States
  - Massachusetts > Middlesex County > Cambridge (0.04)
- Europe
  - France (0.04)
  - United Kingdom > England
    - Oxfordshire > Oxford (0.40)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report (0.93)
- Overview (0.67)

Industry:
- Education (0.68)
- Leisure & Entertainment > Games (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (1.00)
  - Cognitive Science > Problem Solving (0.84)
  - Machine Learning > Neural Networks
    - Deep Learning (0.68)

Duplicate Docs Excel Report

Title
Lear Reward-Fr Yingchen UCL, Jack Uni Oleh UPenn Stephen Uni Building learning 1) to scalability costly rewar then CA, newCASseeks population Acti CA

Similar Docs Excel Report more

Title	Similarity	Source
None found