Optimizing Load Scheduling in Power Grids Using Reinforcement Learning and Markov Decision Processes