Between Instruction and Reward: Human-Prompted Switching