Optimizing Cyber Defense in Dynamic Active Directories through Reinforcement Learning