MAD for Robust Reinforcement Learning in Machine Translation