A Maintenance Planning Framework using Online and Offline Deep Reinforcement Learning