Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics
–Neural Information Processing Systems
Activation functions are fundamental elements of deep learning architectures as they significantly influence training dynamics. ReLU, while widely used, is prone to the dying neuron problem, which has been mitigated by variants such as LeakyReLU, PReLU, and ELU that better handle negative neuron outputs. Recently, self-gated activations like GELU and Swish have emerged as state-of-the-art alternatives, leveraging their smoothness to ensure stable gradient flow and prevent neuron inactivity.
Neural Information Processing Systems
Jun-17-2026, 14:02:45 GMT
- Country:
- Europe > Germany > Baden-Württemberg (0.28)
- Genre:
- Research Report
- New Finding (1.00)
- Experimental Study (1.00)
- Research Report
- Industry:
- Government (0.67)
- Technology: