A Deployed Online Reinforcement Learning Algorithm In An Oral Health Clinical Trial