A Family of Robust Stochastic Operators for Reinforcement Learning