Goto

Collaborating Authors

 cooperative inference




6 Supplementary Materials 6.1 Notation and Definitions Given a set X, we denote the set of probability measures on X

Neural Information Processing Systems

In cooperative inference, Y ang et al. ( 2018) defines a system of Since the teacher's hypothesis marginal and the learner's data marginal are always fixed, our alternating minimization scheme varies conditional probabilities: the hypothesis induced family of Note the other families of conditional probabilities and marginals can be found by Bayes' This is the first of the two equations that define cooperative inference at step one. The neural networks are randomly initialized. The results are listed in Table 1 . In this setting, Y ang et al. ( 2018) shows that the optimal communication plans for the teacher and learner are the same. Translated to our framework, Y ang et al. ( 2018) tells us that Figure 6.1: From left to right (a) model with alternating minimization on matrix In Figure 6.1 (a) and (b), we see that with alternating minimization, the mean of The neural network architectures are typical variational autoencoder architectures.



AI Flow at the Network Edge

Shao, Jiawei, Li, Xuelong

arXiv.org Artificial Intelligence

Recent advancements in large language models (LLMs) and their multimodal variants have led to remarkable progress across various domains, demonstrating impressive capabilities and unprecedented potential. In the era of ubiquitous connectivity, leveraging communication networks to distribute intelligence is a transformative concept, envisioning AI-powered services accessible at the network edge. However, pushing large models from the cloud to resource-constrained environments faces critical challenges. Model inference on low-end devices leads to excessive latency and performance bottlenecks, while raw data transmission over limited bandwidth networks causes high communication overhead. This article presents AI Flow, a framework that streamlines the inference process by jointly leveraging the heterogeneous resources available across devices, edge nodes, and cloud servers, making intelligence flow across networks. To facilitate cooperation among multiple computational nodes, the proposed framework explores a paradigm shift in the design of communication network systems from transmitting information flow to intelligence flow, where the goal of communications is task-oriented and folded into the inference process. Experimental results demonstrate the effectiveness of the proposed framework through an image captioning use case, showcasing the ability to reduce response latency while maintaining high-quality captions. This article serves as a position paper for identifying the motivation, challenges, and principles of AI Flow.


Personalized Deep Learning for Ventricular Arrhythmias Detection on Medical IoT Systems

Jia, Zhenge, Wang, Zhepeng, Hong, Feng, Ping, Lichuan, Shi, Yiyu, Hu, Jingtong

arXiv.org Machine Learning

Life-threatening ventricular arrhythmias (VA) are the leading cause of sudden cardiac death (SCD), which is the most significant cause of natural death in the US. The implantable cardioverter defibrillator (ICD) is a small device implanted to patients under high risk of SCD as a preventive treatment. The ICD continuously monitors the intracardiac rhythm and delivers shock when detecting the life-threatening VA. Traditional methods detect VA by setting criteria on the detected rhythm. However, those methods suffer from a high inappropriate shock rate and require a regular follow-up to optimize criteria parameters for each ICD recipient. To ameliorate the challenges, we propose the personalized computing framework for deep learning based VA detection on medical IoT systems. The system consists of intracardiac and surface rhythm monitors, and the cloud platform for data uploading, diagnosis, and CNN model personalization. We equip the system with real-time inference on both intracardiac and surface rhythm monitors. To improve the detection accuracy, we enable the monitors to detect VA collaboratively by proposing the cooperative inference. We also introduce the CNN personalization for each patient based on the computing framework to tackle the unlabeled and limited rhythm data problem. When compared with the traditional detection algorithm, the proposed method achieves comparable accuracy on VA rhythm detection and 6.6% reduction in inappropriate shock rate, while the average inference latency is kept at 71ms.


Generalizing the theory of cooperative inference

Wang, Pei, Paranamana, Pushpi, Shafto, Patrick

arXiv.org Machine Learning

Cooperation information sharing is important to theories of human learning and has potential implications for machine learning. Prior work derived conditions for achieving optimal Cooperative Inference given strong, relatively restrictive assumptions. We relax these assumptions by demonstrating convergence for any discrete joint distribution, robustness through equivalence classes and stability under perturbation, and effectiveness by deriving bounds from structural properties of the original joint distribution. We provide geometric interpretations, connections to and implications for optimal transport, and connections to importance sampling, and conclude by outlining open questions and challenges to realizing the promise of Cooperative Inference.