Model Based Residual Policy Learning with Applications to Antenna Control