Online Symbolic Music Alignment with Offline Reinforcement Learning