Learning a Robust Multiagent Driving Policy for Traffic Congestion Reduction