Serving Graph Neural Networks With Distributed Fog Servers For Smart IoT Services
Zeng, Liekang, Chen, Xu, Huang, Peng, Luo, Ke, Zhang, Xiaoxi, Zhou, Zhi
–arXiv.org Artificial Intelligence
Graph Neural Networks (GNNs) have gained growing interest in miscellaneous applications owing to their outstanding ability in extracting latent representation on graph structures. To render GNN-based service for IoT-driven smart applications, traditional model serving paradigms usually resort to the cloud by fully uploading geo-distributed input data to remote datacenters. However, our empirical measurements reveal the significant communication overhead of such cloud-based serving and highlight the profound potential in applying the emerging fog computing. To maximize the architectural benefits brought by fog computing, in this paper, we present Fograph, a novel distributed real-time GNN inference framework that leverages diverse and dynamic resources of multiple fog nodes in proximity to IoT data sources. By introducing heterogeneity-aware execution planning and GNN-specific compression techniques, Fograph tailors its design to well accommodate the unique characteristics of GNN serving in fog environments. Prototype-based evaluation and case study demonstrate that Fograph significantly outperforms the state-of-the-art cloud serving and fog deployment by up to 5.39x execution speedup and 6.84x throughput improvement.
arXiv.org Artificial Intelligence
Jul-4-2023
- Country:
- Asia > China
- Guangdong Province > Guangzhou (0.04)
- Europe > Spain
- North America > United States
- California > San Francisco County
- San Francisco (0.04)
- Illinois > Cook County
- Chicago (0.04)
- California > San Francisco County
- Pacific Ocean > North Pacific Ocean
- San Francisco Bay (0.04)
- Asia > China
- Genre:
- Research Report (1.00)
- Industry:
- Energy > Power Industry (0.46)
- Information Technology (1.00)
- Technology:
- Information Technology
- Architecture (1.00)
- Artificial Intelligence
- Cloud Computing (1.00)
- Communications > Networks (1.00)
- Data Science (1.00)
- Information Technology