Why don't we test machine learning as we test software?