Research Papers based on Bottlenecks in Deep Learning Models

Jun-11-2022, 08:20:23 GMT–#artificialintelligence

Abstract: Deep learning has become the most powerful machine learning tool in the last decade. However, how to efficiently train deep neural networks remains to be thoroughly solved. The widely used minibatch stochastic gradient descent (SGD) still needs to be accelerated. As a promising tool to better understand the learning dynamic of minibatch SGD, the information bottleneck (IB) theory claims that the optimization process consists of an initial fitting phase and the following compression phase. Based on this principle, we further study typicality sampling, an efficient data selection method, and propose a new explanation of how it helps accelerate the training process of the deep networks.

accelerator, bottleneck, interaction, (13 more...)

#artificialintelligence

Jun-11-2022, 08:20:23 GMT

News Web Page

Add feedback

Genre:
- Research Report (0.30)
- Overview (0.30)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found