Finite-Sample Analysis for SARSA with Linear Function Approximation
Shaofeng Zou, Tengyu Xu, Yingbin Liang
–Neural Information Processing Systems
SARSA is an on-policy algorithm to learn a Markov decision process policy in reinforcement learning. We investigate the SARSA algorithm with linear function approximation under the non-i.i.d.
Neural Information Processing Systems
Oct-3-2025, 08:27:49 GMT
- Country:
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America
- Canada (0.04)
- United States
- New York > Erie County
- Buffalo (0.04)
- Ohio > Franklin County
- Columbus (0.04)
- New York > Erie County
- Europe > United Kingdom