Model-based Reinforcement Learning and the Eluder Dimension

Feb-8-2025, 18:06:04 GMT–Neural Information Processing Systems

We consider the problem of learning to optimize an unknown Markov decision process (MDP). We show that, if the MDP can be parameterized within some known function class, we can obtain regret bounds that scale with the dimensionality, rather than cardinality, of the system.

dimension, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Feb-8-2025, 18:06:04 GMT

Conferences PDF

Add feedback

Country:
- North America > United States
  - Massachusetts > Middlesex County
    - Belmont (0.04)
  - California > Santa Clara County
    - Palo Alto (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (0.68)
  - Learning Graphical Models > Undirected Networks
    - Markov Models (0.48)

Duplicate Docs Excel Report

Title
Model-based Reinforcement Learning and the Eluder Dimension
Model-based Reinforcement Learning and the Eluder Dimension
Model-based Reinforcement Learning and the Eluder Dimension

Similar Docs Excel Report more

Title	Similarity	Source
None found