Model-Based Exploration in Monitored Markov Decision Processes