On Gap-dependent Bounds for Offline Reinforcement Learning

Aug-15-2025, 05:35:47 GMT–Neural Information Processing Systems

Instead, we have access to a dataset generated from some past suboptimal policies.

assumption, optimal policy, reinforcement learning, (11 more...)

Neural Information Processing Systems

Aug-15-2025, 05:35:47 GMT

Conferences PDF

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
On Gap-dependent Boundsfor Offline Reinforcement Learning

Similar Docs Excel Report more

Title	Similarity	Source
None found