Adaptive Thinking via Mode Policy Optimization for Social Language Agents