Understanding training and generalization in deep learning by Fourier analysis

Aug-13-2018–arXiv.org Machine Learning

Background: It is still an open research area to theoretically understand why Deep Neural Networks (DNNs)-- equipped with many more parameters than training data and trained by (stochastic) gradient-based methods-- often achieve remarkably low generalization error. Contribution: We study DNN training by Fourier analysis. Our theoretical framework explains: i) DNN with (stochastic) gradient-based methods endows low-frequency components of the target function with a higher priority during the training; ii) Small initialization leads to good generalization ability of DNN while preserving the DNN's ability of fitting any function. These results are further confirmed by experiments of DNNs fitting the following datasets, i.e., natural images, one-dimensional functions and MNIST dataset.

artificial intelligence, dnn, machine learning, (20 more...)

arXiv.org Machine Learning

Aug-13-2018

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York (0.04)
- Asia > Middle East
  - UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning > Gradient Descent (1.00)
  - Neural Networks > Deep Learning (0.86)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found