Explaining the physics of transfer learning a data-driven subgrid-scale closure to a different turbulent flow