Online reinforcement learning via sparse Gaussian mixture model Q-functions