Nearly Minimax Optimal Reinforcement Learning for Discounted MDPs

Aug-17-2025, 02:58:02 GMT–Neural Information Processing Systems

More specifically, the discounted MDP is one of the standard MDPs in reinforcement learning to describe sequential tasks without interruption or restart.

artificial intelligence, machine learning, reinforcement learning, (13 more...)

Neural Information Processing Systems

Aug-17-2025, 02:58:02 GMT

Conferences PDF

Country:
- North America > United States
  - California > Los Angeles County > Los Angeles (0.28)
- Europe > United Kingdom
  - England
    - Greater London > London (0.04)
    - Cambridgeshire > Cambridge (0.04)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
bb57db42f77807a9c5823bd8c2d9aaef-Paper.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found