Changing Model Behavior at Test-Time Using Reinforcement Learning