Diving into the shallows: a computational perspective on large-scale shallow learning

Oct-8-2024, 07:56:25 GMT–Neural Information Processing Systems

Remarkable recent success of deep neural networks has not been easy to analyze theoretically. It has been particularly hard to disentangle relative significance of architecture and optimization in achieving accurate classification on large datasets. On the flip side, shallow methods (such as kernel methods) have encountered obstacles in scaling to large data, despite excellent performance on smaller datasets, and extensive theoretical analysis. Practical methods, such as variants of gradient descent used so successfully in deep learning, seem to perform below par when applied to kernel methods. This difficulty has sometimes been attributed to the limitations of shallow architecture. In this paper we identify a basic limitation in gradient descent-based optimization methods when used in conjunctions with smooth kernels.

artificial intelligence, iteration, machine learning, (16 more...)

Neural Information Processing Systems

Oct-8-2024, 07:56:25 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks > Deep Learning (0.86)
  - Statistical Learning > Gradient Descent (0.75)

Duplicate Docs Excel Report

Title
Diving into the shallows: a computational perspective on large-scale shallow learning
Diving into the shallows: a computational perspective on large-scale shallow learning

Similar Docs Excel Report more

Title	Similarity	Source
None found