Measuring Model Performance in the Presence of an Intervention