Robust Policy Expansion for Offline-to-Online RL under Diverse Data Corruption