Solving Zero-Sum Markov Games with Continuous State via Spectral Dynamic Embedding Chenhao Zhou

Oct-10-2025, 07:24:59 GMT–Neural Information Processing Systems

In this paper, we propose a provably efficient natural policy gradient algorithm called Spectral Dynamic Embedding Policy Optimization ( SDEPO) for two-player zero-sum stochastic Markov games with continuous state space and finite action space. In the policy evaluation procedure of our algorithm, a novel kernel embedding method is employed to construct a finite-dimensional linear approximations to the state-action value function.

algorithm, approximation, approximation error, (16 more...)

Neural Information Processing Systems

Oct-10-2025, 07:24:59 GMT

Conferences PDF

Add feedback

Country:
- Europe > Switzerland
  - Zürich > Zürich (0.04)
- Asia
  - Middle East > Jordan (0.04)
  - China (0.04)

Genre:
- Research Report
  - Experimental Study (1.00)
  - New Finding (0.67)

Industry:
- Information Technology (0.45)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Optimization (1.00)
  - Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
Solving Zero-Sum Markov Games with Continuous State via Spectral Dynamic Embedding Chenhao Zhou

Similar Docs Excel Report more

Title	Similarity	Source
None found