Stochastic matrix games with bandit feedback