Breaking the Activation Function Bottleneck through Adaptive Parameterization

Sebastian Flennerhag, Hujun Yin, John Keane, Mark Elliot

Oct-8-2024, 03:59:12 GMT–Neural Information Processing Systems

Standard neural network architectures are non-linear only by virtue of a simple element-wise activation function, making them both brittle and excessively large. In this paper, we consider methods for making the feed-forward layer more flexible while preserving its basic structure. We develop simple drop-in replacements that learn to adapt their parameterization conditional on the input, thereby increasing statistical efficiency significantly.

artificial intelligence, deep learning, machine learning, (15 more...)

Neural Information Processing Systems

Oct-8-2024, 03:59:12 GMT

Conferences PDF

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
Breaking the Activation Function Bottleneck through Adaptive Parameterization
Breaking the Activation Function Bottleneck through Adaptive Parameterization

Similar Docs Excel Report more

Title	Similarity	Source
None found