HRM-Agent: Training a recurrent reasoning model in dynamic environments using reinforcement learning

Open in new window