AITopics

1905.00495

Country: North America > Canada (0.28)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.93)

Moushegian, Sean, Banerjee, Taposh, Tarokh, Vahid

Diffusion-Based Hypothesis Testing and Change-Point Detection

arXiv.org Machine LearningJun-23-2025

Score-based methods have recently seen increasing popularity in modeling and generation. Methods have been constructed to perform hypothesis testing and change-point detection with score functions, but these methods are in general not as powerful as their likelihood-based peers. Recent works consider generalizing the score-based Fisher divergence into a diffusion-divergence by transforming score functions via multiplication with a matrix-valued function or a weight matrix. In this paper, we extend the score-based hypothesis test and change-point detection stopping rule into their diffusion-based analogs. Additionally, we theoretically quantify the performance of these diffusion-based algorithms and study scenarios where optimal performance is achievable. We propose a method of numerically optimizing the weight matrix and present numerical simulations to illustrate the advantages of diffusion-based algorithms.

algorithm, artificial intelligence, machine learning, (17 more...)

2506.16089

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > North Carolina > Durham County > Durham (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (0.62)

Zhang, Jiaojiao, Zhu, Linglingzhi, Fay, Dominik, Johansson, Mikael

Locally Differentially Private Online Federated Learning With Correlated Noise

arXiv.org Machine LearningJan-8-2025

We introduce a locally differentially private (LDP) algorithm for online federated learning that employs temporally correlated noise to improve utility while preserving privacy. To address challenges posed by the correlated noise and local updates with streaming non-IID data, we develop a perturbed iterate analysis that controls the impact of the noise on the utility. Moreover, we demonstrate how the drift errors from local updates can be effectively managed for several classes of nonconvex loss functions. Subject to an $(\epsilon,\delta)$-LDP budget, we establish a dynamic regret bound that quantifies the impact of key parameters and the intensity of changes in the dynamic environment on the learning performance. Numerical experiments confirm the efficacy of the proposed algorithm.

algorithm, correlated noise, noise, (17 more...)

2411.18752

Country:

Europe > Sweden > Stockholm > Stockholm (0.04)
Asia > China > Hong Kong (0.04)

Genre:

Research Report > New Finding (0.93)
Instructional Material > Online (0.61)

Industry: Information Technology > Security & Privacy (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

arXiv.org Machine LearningOct-22-2024

Error estimates between SGD with momentum and underdamped Langevin diffusion

Guillin, Arnaud, Wang, Yu, Xu, Lihu, Yang, Haoran

Stochastic gradient descent with momentum is a popular variant of stochastic gradient descent, which has recently been reported to have a close relationship with the underdamped Langevin diffusion. In this paper, we establish a quantitative error estimate between them in the 1-Wasserstein and total variation distances.

artificial intelligence, lemma 3, machine learning, (15 more...)

2410.17297

Country:

Asia > Macao (0.04)
Asia > China > Beijing > Beijing (0.04)
North America > United States > New York (0.04)
(4 more...)

Genre: Research Report (0.63)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.95)

Butler, Brooks A., Leung, Chi Ho, Paré, Philip E.

Collaborative Safety-Critical Formation Control with Obstacle Avoidance

arXiv.org Artificial IntelligenceOct-4-2024

This work explores a collaborative method for ensuring safety in multi-agent formation control problems. We formulate a control barrier function (CBF) based safety filter control law for a generic distributed formation controller and extend our previously developed collaborative safety framework to an obstacle avoidance problem for agents with acceleration control inputs. We then incorporate multi-obstacle collision avoidance into the collaborative safety framework. This framework includes a method for computing the maximum capability of agents to satisfy their individual safety requirements. We analyze the convergence rate of our collaborative safety algorithm, and prove the linear-time convergence of cooperating agents to a jointly feasible safe action for all agents under the special case of a tree-structured communication network with a single obstacle for each agent. We illustrate the analytical results via simulation on a mass-spring kinematics-based formation controller and demonstrate the finite-time convergence of the collaborative safety algorithm in the simple proven case, the more general case of a fully-connected system with multiple static obstacles, and with dynamic obstacles.

agent, artificial intelligence, neighbor, (15 more...)

2410.03885

Country:

North America > United States > California > Orange County > Irvine (0.04)
Asia > Japan (0.04)

Genre: Research Report (0.63)

Industry: Energy (0.67)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.34)

Edin, Adrian, Chen, Zheng, Kieffer, Michel, Johansson, Mikael

Temporal Predictive Coding for Gradient Compression in Distributed Learning

arXiv.org Artificial IntelligenceOct-3-2024

This paper proposes a prediction-based gradient compression method for distributed learning with event-triggered communication. Our goal is to reduce the amount of information transmitted from the distributed agents to the parameter server by exploiting temporal correlation in the local gradients. We use a linear predictor that \textit{combines past gradients to form a prediction of the current gradient}, with coefficients that are optimized by solving a least-square problem. In each iteration, every agent transmits the predictor coefficients to the server such that the predicted local gradient can be computed. The difference between the true local gradient and the predicted one, termed the \textit{prediction residual, is only transmitted when its norm is above some threshold.} When this additional communication step is omitted, the server uses the prediction as the estimated gradient. This proposed design shows notable performance gains compared to existing methods in the literature, achieving convergence with reduced communication costs.

coefficient, gradient, prediction, (16 more...)

2410.02478

Country:

Europe > Sweden > Östergötland County > Linköping (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)
Europe > France (0.04)

Genre: Research Report (1.00)

Industry: Law > Litigation (0.41)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.69)
Information Technology > Communications > Networks (0.68)

De Lellis, Francesco, Coraggio, Marco, Russo, Giovanni, Musolesi, Mirco, di Bernardo, Mario

Guaranteeing Control Requirements via Reward Shaping in Reinforcement Learning

arXiv.org Artificial IntelligenceNov-16-2023

In addressing control problems such as regulation and tracking through reinforcement learning, it is often required to guarantee that the acquired policy meets essential performance and stability criteria such as a desired settling time and steady-state error prior to deployment. Motivated by this necessity, we present a set of results and a systematic reward shaping procedure that (i) ensures the optimal policy generates trajectories that align with specified control requirements and (ii) allows to assess whether any given policy satisfies them. We validate our approach through comprehensive numerical experiments conducted in two representative environments from OpenAI Gym: the Inverted Pendulum swing-up problem and the Lunar Lander. Utilizing both tabular and deep reinforcement learning methods, our experiments consistently affirm the efficacy of our proposed framework, highlighting its effectiveness in ensuring policy adherence to the prescribed control requirements.

assumption iv, reinforcement, state space sequence, (12 more...)

2311.10026

Country:

North America > United States > California > Santa Barbara County > Santa Barbara (0.14)
Europe > Italy > Campania > Naples (0.04)
Europe > Italy > Emilia-Romagna > Metropolitan City of Bologna > Bologna (0.04)
(5 more...)

Genre: Research Report (0.64)

Industry: Education (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Crafts, Evan Scope, Zhang, Xianyang, Zhao, Bo

Bayesian Cram\'er-Rao Bound Estimation with Score-Based Models

arXiv.org Machine LearningSep-27-2023

The Bayesian Cram\'er-Rao bound (CRB) provides a lower bound on the error of any Bayesian estimator under mild regularity conditions. It can be used to benchmark the performance of estimators, and provides a principled design metric for guiding system design and optimization. However, the Bayesian CRB depends on the prior distribution, which is often unknown for many problems of interest. This work develops a new data-driven estimator for the Bayesian CRB using score matching, a statistical estimation technique, to model the prior distribution. The performance of the estimator is analyzed in both the classical parametric modeling regime and the neural network modeling regime. In both settings, we develop novel non-asymptotic bounds on the score matching error and our Bayesian CRB estimator. Our proofs build on results from empirical process theory, including classical bounds and recently introduced techniques for characterizing neural networks, to address the challenges of bounding the score matching error. The performance of the estimator is illustrated empirically on a denoising problem example with a Gaussian mixture prior.

assumption iv, estimator, logp, (15 more...)

2309.16076

Country:

North America > United States > Texas > Travis County > Austin (0.14)
North America > United States > New York (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Texas > Brazos County > College Station (0.04)

Genre: Research Report (0.82)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)

Yuan, Zhenyuan, Zhu, Minghui

Lightweight Distributed Gaussian Process Regression for Online Machine Learning

arXiv.org Artificial IntelligenceJul-9-2023

In this paper, we study the problem where a group of agents aim to collaboratively learn a common static latent function through streaming data. We propose a lightweight distributed Gaussian process regression (GPR) algorithm that is cognizant of agents' limited capabilities in communication, computation and memory. Each agent independently runs agent-based GPR using local streaming data to predict test points of interest; then the agents collaboratively execute distributed GPR to obtain global predictions over a common sparse set of test points; finally, each agent fuses results from distributed GPR with agent-based GPR to refine its predictions. By quantifying the transient and steady-state performances in predictive variance and error, we show that limited inter-agent communication improves learning performances in the sense of Pareto. Monte Carlo simulation is conducted to evaluate the developed algorithm.

agg, artificial intelligence, machine learning, (18 more...)

2105.04738

Country:

North America > United States > Pennsylvania (0.04)
Oceania > Australia > Queensland > Brisbane (0.04)
North America > United States > Massachusetts (0.04)
(8 more...)

Genre:

Research Report (0.49)
Instructional Material > Online (0.40)

Industry:

Education (0.67)
Energy (0.45)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

arXiv.org Artificial IntelligenceJun-29-2023

Secure and Fast Asynchronous Vertical Federated Learning via Cascaded Hybrid Optimization

Wang, Ganyu, Zhang, Qingsong, Xiang, Li, Wang, Boyu, Gu, Bin, Ling, Charles

--V ertical Federated Learning (VFL) attracts increasing attention because it empowers multiple parties to jointly train a privacy-preserving model over vertically partitioned data. Recent research has shown that applying zeroth-order optimization (ZOO) has many advantages in building a practical VFL algorithm. However, a vital problem with the ZOO-based VFL is its slow convergence rate, which limits its application in handling modern large models. T o address this problem, we propose a cascaded hybrid optimization method in VFL. In this method, the downstream models (clients) are trained with ZOO to protect privacy and ensure that no internal information is shared. Meanwhile, the upstream model (server) is updated with first-order optimization (FOO) locally, which significantly improves the convergence rate, making it feasible to train the large models without compromising privacy and security. We theoretically prove that our VFL framework converges faster than the ZOO-based VFL, as the convergence of our framework is not limited by the size of the server model, making it effective for training large models with the major part on the server . Extensive experiments demonstrate that our method achieves faster convergence than the ZOO-based VFL framework, while maintaining an equivalent level of privacy protection. Moreover, we show that the convergence of our VFL is comparable to the unsafe FOO-based VFL baseline. Additionally, we demonstrate that our method makes the training of a large model feasible. Data availability is essential for machine learning, however, privacy concerns often prevent the direct sharing of data among different parties. This approach allows multiple parties to leverage their data while adhering to the privacy protection measure and the government regulation, such as the General Data Protection Regulation (GDPR) [4]. Bin Gu is with Department of machine learning, Mohamed Bin Za-yed University of Artificial Intelligence, Abu Dhabi, UAE (e-mail: jsgu-bin@gmail.com). Charles Ling, Boyu Wang, Xiang Li, Ganyu Wang is with Department of Computer Science of Western University, London, Ontario, Canada. Qingsong Zhang is with School of Electronic Engineering, Xidian University, Xi'an, China (email: qszhang1995@gmail.com).

artificial intelligence, machine learning, tnull null null 2, (15 more...)

2306.16077

Country:

Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.54)
North America > Canada > Ontario > Middlesex County > London (0.24)
Asia > China > Shaanxi Province > Xi'an (0.24)
(3 more...)

Genre: Research Report > New Finding (0.92)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)