Sub-optimal Policy Aided Multi-Agent Reinforcement Learning for Flocking Control

Open in new window