A Deep Reinforcement Learning Framework for Optimizing Congestion Control in Data Centers