On Oracle-Efficient PAC RL with Rich Observations

Christoph Dann, Nan Jiang, Akshay Krishnamurthy, Alekh Agarwal, John Langford, Robert E. Schapire

Oct-7-2024, 10:59:30 GMT–Neural Information Processing Systems

We study the computational tractability of PAC reinforcement learning with rich observations. We present new provably sample-efficient algorithms for environments with deterministic hidden state dynamics and stochastic rich observations. These methods operate in an oracle model of computation--accessing policy and value function classes exclusively through standard optimization primitives--and therefore represent computationally efficient alternatives to prior algorithms that require enumeration.

artificial intelligence, machine learning, reinforcement learning, (12 more...)

Neural Information Processing Systems

Oct-7-2024, 10:59:30 GMT

Conferences PDF

Add feedback

Country:
- North America > United States > New York (0.15)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Computational Learning Theory (0.68)
    - Reinforcement Learning (0.91)
  - Representation & Reasoning
    - Optimization (1.00)
    - Search (0.68)

Duplicate Docs Excel Report

Title
On Oracle-Efficient PAC RL with Rich Observations
On Oracle-Efficient PAC RL with Rich Observations

Similar Docs Excel Report more

Title	Similarity	Source
None found