Agent-GSPO: Communication-Efficient Multi-Agent Systems via Group Sequence Policy Optimization

Open in new window