Genetic multi-armed bandits: a reinforcement learning approach for discrete optimization via simulation

Open in new window