Distilling a Neural Network Into a Soft Decision Tree