Multi-Agent Guided Policy Optimization