REFINE-LM: Mitigating Language Model Stereotypes via Reinforcement Learning