Rethink the Connections among Generalization, Memorization and the Spectral Bias of DNNs
Zhang, Xiao, Xiong, Haoyi, Wu, Dongrui
Over-parameterized deep neural networks (DNNs) with sufficient capacity to memorize random noise can achieve excellent generalization performance, challenging the bias-variance trade-off in classical learning theory. Recent studies claimed that DNNs first learn simple patterns and then memorize noise; some other works showed a phenomenon that DNNs have a spectral bias to learn target functions from low to high frequencies during training. However, we show that the monotonicity of the learning bias does not always hold: under the experimental setup of deep double descent, the high-frequency components of DNNs diminish in the late stage of training, leading to the second descent of the test error. Besides, we find that the spectrum of DNNs can be applied to indicating the second descent of the test error, even though it is calculated from the training set only.
Jun-5-2021
- Country:
- Oceania > Australia
- New South Wales > Sydney (0.04)
- North America
- United States
- Nevada > Clark County
- Las Vegas (0.04)
- California
- San Diego County > San Diego (0.04)
- Los Angeles County > Long Beach (0.04)
- Nevada > Clark County
- Canada
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- United States
- Europe
- France (0.04)
- Spain
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Andalusia > Granada Province
- Granada (0.04)
- Catalonia > Barcelona Province
- Asia > China
- Hubei Province > Wuhan (0.04)
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- Oceania > Australia
- Genre:
- Research Report
- New Finding (0.46)
- Experimental Study (0.34)
- Research Report
- Technology: