SEM: Reinforcement Learning for Search-Efficient Large Language Models

Open in new window