NeuraLUT-Assemble: Hardware-aware Assembling of Sub-Neural Networks for Efficient LUT Inference
Andronic, Marta, Constantinides, George A.
–arXiv.org Artificial Intelligence
--Efficient neural networks (NNs) leveraging lookup tables (LUTs) have demonstrated significant potential for emerging AI applications, particularly when deployed on field-programmable gate arrays (FPGAs) for edge computing. These architectures promise ultra-low latency and reduced resource utilization, broadening neural network adoption in fields such as particle physics. However, existing LUT -based designs suffer from accuracy degradation due to the large fan-in required by neurons being limited by the exponential scaling of LUT resources with input width. In practice, in prior work this tension has resulted in the reliance on extremely sparse models. We present NeuraLUT -Assemble, a novel framework that addresses these limitations by combining mixed-precision techniques with the assembly of larger neurons from smaller units, thereby increasing connectivity while keeping the number of inputs of any given LUT manageable. Additionally, we introduce skip-connections across entire LUT structures to improve gradient flow. NeuraLUT -Assemble closes the accuracy gap between LUT -based methods and (fully-connected) MLP-based models, achieving competitive accuracy on tasks such as network intrusion detection, digit classification, and jet classification, demonstrating up to 8 . Ultra-low latency NN inference has become instrumental in advancing fields such as particle physics, network security, and autonomous vehicles. In particle physics, machine learning (ML) models are essential for handling the immense data volumes generated by detectors.
arXiv.org Artificial Intelligence
Apr-1-2025
- Country:
- North America > United States
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California
- Santa Clara County > San Jose (0.04)
- San Diego County > San Diego (0.04)
- Monterey County > Monterey (0.04)
- Louisiana > Orleans Parish
- Europe
- Italy > Sardinia (0.04)
- France (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- Sweden > Vaestra Goetaland
- Gothenburg (0.04)
- Asia > Japan
- Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
- North America > United States
- Genre:
- Research Report (1.00)
- Industry:
- Information Technology > Security & Privacy (0.68)
- Technology: