Network-wide traffic signal control optimization using a multi-agent deep reinforcement learning