The unreasonable effectiveness of few-shot learning for machine translation