Whitening and second order optimization both destroy information about the dataset, and can make generalization impossible