Manipulating Sparse Double Descent

Jan-19-2024–arXiv.org Artificial Intelligence

This paper investigates the double descent phenomenon in two-layer neural networks, focusing on the role of L1 regularization and representation dimensions. It explores an alternative double descent phenomenon, named'sparse double descent'. The study emphasizes the complex relationship between model complexity, sparsity, and generalization, and suggests further research into more diverse models and datasets. The findings contribute to a deeper understanding of neural network training and optimization.

double descent phenomenon, neural network, regression, (11 more...)

arXiv.org Artificial Intelligence

Jan-19-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York > New York County > New York City (0.05)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.05)

Genre:
- Research Report (0.91)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Statistical Learning (1.00)
  - Neural Networks > Deep Learning (0.30)