Towards On-Device Learning and Reconfigurable Hardware Implementation for Encoded Single-Photon Signal Processing
Zang, Zhenya, Li, Xingda, Li, David Day Uei
–arXiv.org Artificial Intelligence
--Deep neural networks (DNNs) enhance the accuracy and efficiency of reconstructing key parameters from time-resolved photon arrival signals recorded by single-photon detectors. However, the performance of conventional backpropagation-based DNNs is highly dependent on various parameters of the optical setup and biological samples under examination, necessitating frequent network retraining--either through transfer learning or from scratch. Newly collected data must also be stored and transferred to a high-performance GPU server for retraining, introducing latency and storage overhead. T o address these challenges, we propose an online training algorithm based on a One-Sided Jacobi rotation-based Online Sequential Extreme Learning Machine (OSOS-ELM). We fully exploit parallelism in executing OSOS-ELM on a heterogeneous FPGA with integrated ARM cores. Extensive evaluations of OSOS-ELM and OS-ELM demonstrate that both achieve comparable accuracy across different network dimensions (i.e., input, hidden, and output layers), while OSOS-ELM proves to be more hardware-efficient. By leveraging the parallelism of OSOS-ELM, we implement a holistic computing prototype on a Xilinx ZCU104 FPGA, which integrates a multi-core CPU and programmable logic fabric. We also implement our OSOS-ELM on the NVIDIA Jetson Xavier NX GPU to comprehensively investigate its computing performance on another type of heterogeneous computing platform. N-device training of neural networks has been emerging in recent decades. On-device training and inference save the overhead of data transfer to data centers, memory management, and computing on the cloud. The number of edge devices is increasing exponentially and is expected to reach 1 trillion by 2035 [1]. Latency tends to be a bottleneck of real-time applications such as healthcare and machine automation. Additionally, information privacy can be threatened when uploading and offloading sensitive biomedical data to the cloud. This work is supported by the EPSRC (EP/T00097X/1); the Quantum Technology Hub in Quantum Imaging (QuantiC), and the University of Strathclyde. Xingda Li also acknowledges support from China Scholarship Council.
arXiv.org Artificial Intelligence
Apr-15-2025
- Country:
- Asia > China (0.24)
- Europe
- North America > United States (0.28)
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Technology: