Differential Description Length for Hyperparameter Selection in Machine Learning

Host-Madsen, Anders, Abolfazli, Mojtaba, Zhang, June

Feb-12-2019–arXiv.org Machine Learning

This paper introduces a new method for model selection and more generally hyperparameter selection in machine learning. The paper first proves a relationship between generalization error and a difference of description lengths of the training data; we call this difference differential description length (DDL). This allows prediction of generalization error from the training data \emph{alone} by performing encoding of the training data. This can now be used for model selection by choosing the model that has the smallest predicted generalization error. We show how this encoding can be done for linear regression and neural networks. We provide experiments showing that this leads to smaller generalization error than cross-validation and traditional MDL and Bayes methods.

codelength, description length, generalization error, (14 more...)

arXiv.org Machine Learning

Feb-12-2019

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Colorado (0.04)
  - New York > New York County
    - New York City (0.04)
  - New Jersey > Mercer County
    - Princeton (0.04)
  - Hawaii > Honolulu County
    - Honolulu (0.04)
- Europe > Sweden
  - Stockholm > Stockholm (0.04)

Genre:
- Research Report (0.82)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Uncertainty
    - Bayesian Inference (0.47)
  - Machine Learning
    - Statistical Learning (1.00)
    - Learning Graphical Models > Directed Networks
      - Bayesian Learning (0.47)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found