Optimizing Neural Networks with Learnable Non-Linear Activation Functions via Lookup-Based FPGA Acceleration

Yin, Mengyuan, Choong, Benjamin Chen Ming, Qu, Chuping, Goh, Rick Siow Mong, Wong, Weng-Fai, Luo, Tao

Aug-26-2025–arXiv.org Artificial Intelligence

--Learned activation functions in models like Kolmogorov-Arnold Networks (KANs) outperform fixed-activation architectures in terms of accuracy and interpretability; however, their computational complexity poses critical challenges for energy-constrained edge AI deployments. Conventional CPUs/GPUs incur prohibitive latency and power costs when evaluating higher order activations, limiting deployability under ultra-tight energy budgets. We address this via a reconfigurable lookup architecture with edge FPGAs. FPGA reconfigurability enables dynamic hardware specialization for learned functions, a key advantage for edge systems that require post-deployment adaptability. This breakthrough positions our approach as a practical enabler for energy-critical edge AI, where computational intensity and power constraints traditionally preclude the use of adaptive activation networks. The development of effective activation functions has long been a central focus in machine learning research to enhance neural network capabilities. Neural networks with trainable activation functions represent an important and actively explored class of models, attracting growing research interest due to their potential to enhance model expressivity and adaptability to specific tasks [1] - complementing models with traditional fixed functions such as ReLU [2] and Leaky ReLU [3]. Learnable activation functions can be classified into two main categories: parameterized standard activation functions and ensemble-based activation functions [4].

activation function, artificial intelligence, machine learning, (17 more...)

arXiv.org Artificial Intelligence

Aug-26-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.28)

Genre:
- Research Report (0.50)

Industry:
- Energy (0.48)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)