A Lightweight Calibrated Simulation Enabling Efficient Offline Learning for Optimal Control of Real Buildings