Laxity-Aware Scalable Reinforcement Learning for HVAC Control