Deep Reinforcement Learning for Cyber System Defense under Dynamic Adversarial Uncertainties