RAPID: An Efficient Reinforcement Learning Algorithm for Small Language Models

Open in new window