Why does Throwing Away Data Improve Worst-Group Error?

Open in new window