MiWaves Reinforcement Learning Algorithm