On Exploration, Exploitation and Learning in Adaptive Importance Sampling