Offline-Online Reinforcement Learning for Linear Mixture MDPs