Nonparametric Gaussian mixture models for the multi-armed contextual bandit