Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave Networks