Implicit Finite-Horizon Approximation and Efficient Optimal Algorithms for Stochastic Shortest Path

Neural Information Processing Systems 

Tarbouriech et al. [2020a] develop the first regret minimization algorithm for SSP with a regret bound

Similar Docs  Excel Report  more

TitleSimilaritySource
None found