Bi-CL: A Reinforcement Learning Framework for Robots Coordination Through Bi-level Optimization