P1: Mastering Physics Olympiads with Reinforcement Learning