Adaptive Thinking via Mode Policy Optimization for Social Language Agents

Open in new window