Action-Free Offline-to-Online RL via Discretised State Policies