Robust Generalization despite Distribution Shift via Minimum Discriminating Information

Neural Information Processing Systems 

Consider building a model for diagnosing a specific heart disease, and suppose that most participants of the study are middle to high-aged men.