BitParticle: Partializing Sparse Dual-Factors to Build Quasi-Synchronizing MAC Arrays for Energy-efficient DNNs
Qiaoyuan, Feilong, Wang, Jihe, Sun, Zhiyu, Wu, Linying, Xiao, Yuanhua, Wang, Danghui
–arXiv.org Artificial Intelligence
--Bit-level sparsity in quantized deep neural networks (DNNs) offers significant potential for optimizing Multiply-Accumulate (MAC) operations. However, two key challenges still limit its practical exploitation. Methods designed to exploit dual-factor sparsity are still in the early stages of exploration, facing the challenge of partial product explosion. Second, the fluctuation of bit-level sparsity leads to variable cycle counts for MAC operations. Existing synchronous scheduling schemes that are suitable for dual-factor sparsity exhibit poor flexibility and still result in significant underutilization of MAC units. T o address the first challenge, this study proposes a MAC unit that leverages dual-factor sparsity through the emerging particlization-based approach. The proposed design addresses the issue of partial product explosion through simple control logic, resulting in a more area-and energy-efficient MAC unit. In addition, by discarding less significant intermediate results, the design allows for further hardware simplification at the cost of minor accuracy loss. T o address the second challenge, a quasi-synchronous scheme is introduced that adds cycle-level elasticity to the MAC array, reducing pipeline stalls and thereby improving MAC unit utilization. Evaluation results show that the exact version of the proposed MAC array architecture achieves a 29.2% improvement in area efficiency compared to the state-of-the-art bit-sparsity-driven architecture, while maintaining comparable energy efficiency. The approximate variant further improves energy efficiency by 7.5%, compared to the exact version. Due to the limited computing power of edge devices, deploying fixed-point quantized models in edge DNN architectures has become a common practice [1], [2].
arXiv.org Artificial Intelligence
Jul-15-2025
- Country:
- Asia > China
- Shaanxi Province > Xi'an (0.04)
- North America > United States
- Florida > Broward County > Fort Lauderdale (0.04)
- Asia > China
- Genre:
- Research Report > New Finding (0.48)
- Technology: