Randomised Optimism via Competitive Co-Evolution for Matrix Games with Bandit Feedback

Open in new window