A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic Games

Feb-11-2025, 15:59:13 GMT–Neural Information Processing Systems

In this work, we study two-player zero-sum stochastic games and develop a variant of the smoothed best-response learning dynamics that combines independent learning dynamics for matrix games with the minimax value iteration for stochastic games. The resulting learning dynamics are payoff-based, convergent, rational, and symmetric between the two players.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Feb-11-2025, 15:59:13 GMT

Conferences PDF

Add feedback

Genre:
- Overview (0.45)

Duplicate Docs Excel Report

Title
A Finite-Sample Analysis of Payoff-Based Independent Learning in Zero-Sum Stochastic Games

Similar Docs Excel Report more

Title	Similarity	Source
None found