LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities

Open in new window