Learning Two-Player Mixture Markov Games: Kernel Function Approximation and Correlated Equilibrium

Open in new window