AITopics | Peng, Xian

Collaborating Authors

Peng, Xian

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

DistrEE: Distributed Early Exit of Deep Neural Network Inference on Edge Devices

Peng, Xian, Wu, Xin, Xu, Lianming, Wang, Li, Fei, Aiguo

arXiv.org Artificial IntelligenceFeb-6-2025

DistrEE: Distributed Early Exit of Deep Neural Network Inference on Edge Devices Xian Peng, Xin Wu, Lianming Xu, Li Wang and Aiguo Fei School of Computer Science (National Pilot Software Engineering School), Beijing University of Posts and Telecommunications, Beijing, China School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing, China Abstract --Distributed DNN inference is becoming increasingly important as the demand for intelligent services at the network edge grows. By leveraging the power of distributed computing, edge devices can perform complicated and resource-hungry inference tasks previously only possible on powerful servers, enabling new applications in areas such as autonomous vehicles, industrial automation, and smart homes. However, it is challenging to achieve accurate and efficient distributed edge inference due to the fluctuating nature of the actual resources of the devices and the processing difficulty of the input data. In this work, we propose DistrEE, a distributed DNN inference framework that can exit model inference early to meet specific quality of service requirements. In particular, the framework firstly integrates model early exit and distributed inference for multi-node collaborative inferencing scenarios. Furthermore, it designs an early exit policy to control when the model inference terminates.

artificial intelligence, deep learning, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2502.15735

Country: Asia > China > Beijing > Beijing (0.85)

Genre: Research Report (1.00)

Industry: Information Technology > Smart Houses & Appliances (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Failure-Resilient Distributed Inference with Model Compression over Heterogeneous Edge Devices

Wang, Li, Li, Liang, Xu, Lianming, Peng, Xian, Fei, Aiguo

arXiv.org Artificial IntelligenceJun-20-2024

The distributed inference paradigm enables the computation workload to be distributed across multiple devices, facilitating the implementations of deep learning based intelligent services on extremely resource-constrained Internet of Things (IoT) scenarios. Yet it raises great challenges to perform complicated inference tasks relying on a cluster of IoT devices that are heterogeneous in their computing/communication capacity and prone to crash or timeout failures. In this paper, we present RoCoIn, a robust cooperative inference mechanism for locally distributed execution of deep neural network-based inference tasks over heterogeneous edge devices. It creates a set of independent and compact student models that are learned from a large model using knowledge distillation for distributed deployment. In particular, the devices are strategically grouped to redundantly deploy and execute the same student model such that the inference process is resilient to any local failures, while a joint knowledge partition and student model assignment scheme are designed to minimize the response latency of the distributed inference system in the presence of devices with diverse capacities. Extensive simulations are conducted to corroborate the superior performance of our RoCoIn for distributed inference compared to several baselines, and the results demonstrate its efficacy in timely inference and failure resiliency.

artificial intelligence, edge device, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2406.14185

Country:

Asia > China (0.14)
North America > Canada (0.14)

Genre: Research Report (0.70)

Industry:

Education (1.00)
Information Technology > Smart Houses & Appliances (0.34)

Technology:

Information Technology > Internet of Things (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback