On Entropy Control in LLM-RL Algorithms

Open in new window