Reinforcement Learning with General Value Function Approximation: Provably Efficient Approach via Bounded Eluder Dimension

Neural Information Processing Systems 

In reinforcement learning (RL), we study how an agent maximizes the cumulative reward by interacting with an unknown environment. RL finds enormous applications in a wide variety of domains, e.g., robotics [

Duplicate Docs Excel Report

Similar Docs  Excel Report  more

TitleSimilaritySource
None found