Finite-Sample Analysis for SARSA with Linear Function Approximation

Dec-25-2025, 19:32:24 GMT–Neural Information Processing Systems

SARSA is an on-policy algorithm to learn a Markov decision process policy in reinforcement learning. We investigate the SARSA algorithm with linear function approximation under the non-i.i.d.\ setting, where a single sample trajectory is available. With a Lipschitz continuous policy improvement operator that is smooth enough, SARSA has been shown to converge asymptotically. However, its non-asymptotic analysis is challenging and remains unsolved due to the non-i.i.d.

algorithm, finite-sample analysis, sarsa algorithm, (4 more...)

Neural Information Processing Systems

Dec-25-2025, 19:32:24 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.97)