Effective Testing for Machine Learning (Part II)
Since the mean of the target variable decreased, the regression problem got easier. Picture the distribution of a numerical variable: a model predicting zero will have an MAE equal to the absolute mean of the distribution; now, imagine you add recently generated data that increases the concentration of your target variable even more (i.e., the mean decreases): if you evaluate the model that always predicts zero, the MAE will decrease, giving the impression that your new model got better! After meeting with business stakeholders, we found out that a recent change in the data source introduced spurious observations.
Apr-2-2022, 15:59:33 GMT
- Technology: