Anytime-Competitive Reinforcement Learning with Policy Prior