Regret bounds for Narendra-Shapiro bandit algorithms

Open in new window