A Comparison of Classical and Deep Reinforcement Learning Methods for HVAC Control