Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path

Open in new window