Bayesian Design Principles for Offline-to-Online Reinforcement Learning