Differential Description Length for Hyperparameter Selection in Machine Learning