eeb69a3cb92300456b6a5f4162093851-Paper.pdf

Feb-11-2026, 19:35:31 GMT–Neural Information Processing Systems

We study the Stochastic Shortest Path (SSP) problem in which an agent has to reach a goal state in minimum total expected cost. In the learning formulation ofthe problem, the agent has no prior knowledge about the costs and dynamics of the model. She repeatedly interacts with the model forK episodes, and has to minimize her regret. In this work we show that the minimax regret for this setting is eO( p (B2?+B?)|S||A|K)whereB?

artificial intelligence, arxivpreprintarxiv, machine learning, (16 more...)

Neural Information Processing Systems

Feb-11-2026, 19:35:31 GMT

Conferences PDF

Add feedback

Country:
- North America
  - United States > Nevada (0.04)
  - Canada > British Columbia
    - Metro Vancouver Regional District > Vancouver (0.04)
- Asia > Middle East
  - Israel
    - Tel Aviv District > Tel Aviv (0.04)
    - Haifa District > Haifa (0.04)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (0.70)
  - Representation & Reasoning (0.67)

Duplicate Docs Excel Report

Title
Minimax Regret for Stochastic Shortest Path

Similar Docs Excel Report more

Title	Similarity	Source
None found