Near-optimal Regret Bounds for Stochastic Shortest Path

Open in new window