Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control