Implicit Finite-Horizon Approximation and Efficient Optimal Algorithms for Stochastic Shortest Path
–Neural Information Processing Systems
Tarbouriech et al. [2020a] develop the first regret minimization algorithm for SSP with a regret bound
Neural Information Processing Systems
Aug-14-2025, 16:27:00 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- North America > United States
- California (0.14)
- Asia > Middle East
- Genre:
- Research Report (0.46)
- Technology: