Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies
–arXiv.org Artificial Intelligence
The emergence of 5G and edge computing hardware has brought about a significant shift in artificial intelligence, with edge AI becoming a crucial technology for enabling intelligent applications. With the growing amount of data generated and stored on edge devices, deploying AI models for local processing and inference has become increasingly necessary. However, deploying state-of-the-art AI models on resource-constrained edge devices faces significant challenges that must be addressed. This paper presents an optimization triad for efficient and reliable edge AI deployment, including data, model, and system optimization. First, we discuss optimizing data through data cleaning, compression, and augmentation to make it more suitable for edge deployment. Second, we explore model design and compression methods at the model level, such as pruning, quantization, and knowledge distillation. Finally, we introduce system optimization techniques like framework support and hardware acceleration to accelerate edge AI workflows. Based on an in-depth analysis of various application scenarios and deployment challenges of edge AI, this paper proposes an optimization paradigm based on the data-model-system triad to enable a whole set of solutions to effectively transfer ML models, which are initially trained in the cloud, to various edge devices for supporting multiple scenarios.
arXiv.org Artificial Intelligence
Jan-4-2025
- Genre:
- Research Report > Promising Solution (1.00)
- Overview (1.00)
- Industry:
- Health & Medicine > Therapeutic Area (1.00)
- Energy > Power Industry (0.92)
- Telecommunications (0.92)
- Information Technology
- Security & Privacy (1.00)
- Software (0.68)
- Services (0.67)
- Technology:
- Information Technology
- Sensing and Signal Processing > Image Processing (1.00)
- Communications > Networks (1.00)
- Architecture > Real Time Systems (1.00)
- Data Science
- Data Quality (1.00)
- Data Mining (1.00)
- Artificial Intelligence
- Vision (1.00)
- Robots (1.00)
- Representation & Reasoning > Optimization (1.00)
- Natural Language (1.00)
- Cognitive Science (1.00)
- Applied AI (1.00)
- Machine Learning > Neural Networks
- Deep Learning (1.00)
- Information Technology