AITopics

The transfer-based black-box adversarial attack setting poses the challenge of crafting an adversarial example (AE) on known surrogate models that remain effective against unseen target models. Due to the practical importance of this task, numerous methods have been proposed to address this challenge. However, most previous methods are heuristically designed and intuitively justified, lacking a theoretical foundation. To bridge this gap, we derive a novel transferability bound that offers provable guarantees for adversarial transferability. Our theoretical analysis has the advantages of \textit{(i)} deepening our understanding of previous methods by building a general attack framework and \textit{(ii)} providing guidance for designing an effective attack algorithm. Our theoretical results demonstrate that optimizing AEs toward flat minima over the surrogate model set, while controlling the surrogate-target model shift measured by the adversarial model discrepancy, yields a comprehensive guarantee for AE transferability. The results further lead to a general transfer-based attack framework, within which we observe that previous methods consider only partial factors contributing to the transferability. Algorithmically, inspired by our theoretical results, we first elaborately construct the surrogate model set in which models exhibit diverse adversarial vulnerabilities with respect to AEs to narrow an instantiated adversarial model discrepancy. Then, a \textit{model-Diversity-compatible Reverse Adversarial Perturbation} (DRAP) is generated to effectively promote the flatness of AEs over diverse surrogate models to improve transferability. Extensive experiments on NIPS2017 and CIFAR-10 datasets against various target models demonstrate the effectiveness of our proposed attack.

artificial intelligence, machine learning, optimization problem, (19 more...)

2504.16474

Country: Asia > China (0.68)

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
(2 more...)

QAOA-GPT: Efficient Generation of Adaptive and Regular Quantum Approximate Optimization Algorithm Circuits

Tyagin, Ilya, Farag, Marwa H., Sherbert, Kyle, Shirali, Karunya, Alexeev, Yuri, Safro, Ilya

--Quantum computing has the potential to improve our ability to solve certain optimization problems that are computationally difficult for classical computers, by offering new algorithmic approaches that may provide speedups under specific conditions. In this work, we introduce QAOA-GPT, a generative framework that leverages Generative Pretrained Transformers (GPT) to directly synthesize quantum circuits for solving quadratic unconstrained binary optimization problems, and demonstrate it on the MaxCut problem on graphs. T o diversify the training circuits and ensure their quality, we have generated a synthetic dataset using the adaptive QAOA approach, a method that incrementally builds and optimizes problem-specific circuits. The experiments conducted on a curated set of graph instances demonstrate that QAOA-GPT, generates high quality quantum circuits for new problem instances unseen in the training as well as successfully parametrizes QAOA. Our results show that using QAOA-GPT to generate quantum circuits will significantly decrease both the computational overhead of classical QAOA and adaptive approaches that often use gradient evaluation to generate the circuit and the classical optimization of the circuit parameters. Our work shows that generative AI could be a promising avenue to generate compact quantum circuits in a scalable way. Quantum computing is rapidly emerging technology with significant potential across various domains, including finance [1], chemical simulations [2], material science [3], combinatorial optimization [4], and machine learning [5], among others. V ariational quantum-classical algorithms represent one of the most promising classes of quantum algorithms in different domains, showing potential for both fault-tolerant quantum computers and near-term noisy intermediate-scale quantum (NISQ) devices. The Quantum Approximate Optimization Algorithm (QAOA) [6] and many of its subsequent versions and customizations [7] belong to this class and demonstrate great potential due to their problem/application flexibility and compatibility with various quantum architectures. The original QAOA framework employs a fixed ansatz structure, which can limit expressibility and hinder performance, particularly on near-term quantum devices where circuit depth is limited. This rigid design may not capture the problem-specific features needed for efficient optimization. Such methods as ADAPT -QAOA [8] address this challenge by iteratively constructing the ansatz in a problem-informed manner. At each step, ADAPT -QAOA selects operators from a predefined pool based on their gradient with respect to the cost function, incorporating only those that contribute most significantly to improving the objective.

large language model, machine learning, natural language, (18 more...)

2504.1635

Country: North America > United States (1.00)

Genre: Research Report > New Finding (0.86)

Industry:

Energy (0.68)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)

Learning Explainable Dense Reward Shapes via Bayesian Optimization

Koo, Ryan, Yang, Ian, Raheja, Vipul, Hong, Mingyi, Jun, Kwang-Sung, Kang, Dongyeop

Current reinforcement learning from human feedback (RLHF) pipelines for large language model (LLM) alignment typically assign scalar rewards to sequences, using the final token as a surrogate indicator for the quality of the entire sequence. However, this leads to sparse feedback and suboptimal token-level credit assignment. In this work, we frame reward shaping as an optimization problem focused on token-level credit assignment. We propose a reward-shaping function leveraging explainability methods such as SHAP and LIME to estimate per-token rewards from the reward model. To learn parameters of this shaping function, we employ a bilevel optimization framework that integrates Bayesian Optimization and policy training to handle noise from the token reward estimates. Our experiments show that achieving a better balance of token-level reward attribution leads to performance improvements over baselines on downstream tasks and finds an optimal policy faster during training. Furthermore, we show theoretically that explainability methods that are feature additive attribution functions maintain the optimal policy as the original reward.

large language model, machine learning, natural language, (19 more...)

2504.16272

Country: North America > United States (0.94)

Genre: Research Report (0.85)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.68)
Health & Medicine > Therapeutic Area > Neurology (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Two-Timescale Joint Transmit and Pinching Beamforming for Pinching-Antenna Systems

Zhang, Luyuan, Mu, Xidong, Liu, An, Liu, Yuanwei

--Pinching antenna systems (PASS) have been proposed as a revolutionary flexible antenna technology which facilitates line-of-sight links via numerous low-cost pinching antennas with adjustable activation positions over waveguides. This letter proposes a two-timescale joint transmit and pinching beamforming design for the maximization of sum rate of a PASS-based downlink multi-user multiple input single output system. A primal dual decomposition method is developed to decouple the two-timescale problem into two sub-problems: 1) A Karush-Kuhn-T ucker-guided dual learning-based approach is proposed to solve the short-term transmit beamforming design sub-problem; 2) The long-term pinching beamforming design sub-problem is tackled by adopting a stochastic successive convex approximation method. Simulation results demonstrate that the proposed two-timescale algorithm achieves a significant performance gain compared to other baselines. Flexible-antenna techniques such as reconfigurable intelligent surfaces (RISs) [1], movable antennas [2], and fluid antennas [3], have been developed to break the limitation of fixed-channel assumptions in the sixth generation (6G) and beyond wireless network.

artificial intelligence, machine learning, waveguide, (15 more...)

2504.16099

Country: Asia > China (0.28)

Genre: Research Report (0.70)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.89)

Bakaryan, Tigran, Aoun, Christoph, Ribeiro, Ricardo de Lima, Hovakimyan, Naira, Gomes, Diogo

Hessian Riemannian Flow For Multi-Population Wardrop Equilibrium

Abstract-- In this paper, we address the problem of optimizing flows on generalized graphs that feature multiple entry points and multiple populations, each with varying co st structures. We tackle this problem by considering the multi - population Wardrop equilibrium, defined through variation al inequalities. We rigorously analyze the existence and uniq ueness of the Wardrop equilibrium. Furthermore, we introduce an efficient numerical method to find the solution. In particula r, we reformulate the equilibrium problem as a distributed optimization problem over subgraphs and introduce a novel Hessian Riemannian flow method--a Riemannian-manifold-projected Hessian flow--to efficiently compute a solution. Fi - nally, we demonstrate the effectiveness of our approach thr ough examples in urban traffic management, including routing for diverse vehicle types and strategies for minimizing emissi ons in congested environments. In traffic management, each driver--whether operating a car, SUV, or truck--selects the route they perceive to be the shortest.

artificial intelligence, equilibrium, optimization problem, (15 more...)

2504.16028

Country:

North America > United States > Illinois (0.14)
Asia > Middle East > Saudi Arabia (0.14)

Genre: Research Report (1.00)

Industry:

Transportation (0.55)
Energy (0.46)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.88)

Imgrund, Erik, Eisenhofer, Thorsten, Rieck, Konrad

Adversarial Observations in Weather Forecasting

AI-based systems, such as Google's GenCast, have recently redefined the state of the art in weather forecasting, offering more accurate and timely predictions of both everyday weather and extreme events. While these systems are on the verge of replacing traditional meteorological methods, they also introduce new vulnerabilities into the forecasting process. In this paper, we investigate this threat and present a novel attack on autoregressive diffusion models, such as those used in GenCast, capable of manipulating weather forecasts and fabricating extreme events, including hurricanes, heat waves, and intense rainfall. The attack introduces subtle perturbations into weather observations that are statistically indistinguishable from natural noise and change less than 0.1% of the measurements - comparable to tampering with data from a single meteorological satellite. As modern forecasting integrates data from nearly a hundred satellites and many other sources operated by different countries, our findings highlight a critical security risk with the potential to cause large-scale disruptions and undermine public trust in weather prediction.

adversarial observation, artificial intelligence, machine learning, (16 more...)

2504.15942

Country:

North America > United States (1.00)
Europe (1.00)
Asia (1.00)

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Energy > Renewable (1.00)
Government > Military (0.68)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Solving Multi-Agent Safe Optimal Control with Distributed Epigraph Form MARL

Zhang, Songyuan, So, Oswin, Black, Mitchell, Serlin, Zachary, Fan, Chuchu

Tasks for multi-robot systems often require the robots to collaborate and complete a team goal while maintaining safety. This problem is usually formalized as a constrained Markov decision process (CMDP), which targets minimizing a global cost and bringing the mean of constraint violation below a user-defined threshold. Inspired by real-world robotic applications, we define safety as zero constraint violation. While many safe multi-agent reinforcement learning (MARL) algorithms have been proposed to solve CMDPs, these algorithms suffer from unstable training in this setting. To tackle this, we use the epigraph form for constrained optimization to improve training stability and prove that the centralized epigraph form problem can be solved in a distributed fashion by each agent. This results in a novel centralized training distributed execution MARL algorithm named Def-MARL. Simulation experiments on 8 different tasks across 2 different simulators show that Def-MARL achieves the best overall performance, satisfies safety constraints, and maintains stable training. Real-world hardware experiments on Crazyflie quadcopters demonstrate the ability of Def-MARL to safely coordinate agents to complete complex collaborative tasks compared to other methods.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

2504.15425

Country:

North America > United States > Massachusetts > Middlesex County (0.28)
North America > United States > California (0.28)

Genre: Research Report > New Finding (0.92)

Industry:

Transportation (0.93)
Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.46)

Post-Convergence Sim-to-Real Policy Transfer: A Principled Alternative to Cherry-Picking

Khor, Dylan, Weng, Bowen

Post-Convergence Sim-to-Real Policy Transfer: A Principled Alternative to Cherry-Picking Dylan Khor 1 and Bowen Weng 1 Abstract -- Learning-based approaches, particularly reinforcement learning (RL), have become widely used for developing control policies for autonomous agents, such as locomotion policies for legged robots. Starting from a randomly initialized policy, the empirical expected reward follows a trajectory with an overall increasing trend. While some policies become temporarily stuck in local optima, a well-defined training process generally converges to a reward level with noisy oscillations. However, selecting a policy for real-world deployment is rarely an analytical decision (i.e., simply choosing the one with the highest reward) and is instead often performed through trial and error . T o improve sim-to-real transfer, most research focuses on the pre-convergence stage, employing techniques such as domain randomization, multi-fidelity training, adversarial training, and architectural innovations. However, these methods do not eliminate the inevitable convergence trajectory and noisy oscillations of rewards, leading to heuristic policy selection or cherry-picking. This paper addresses the post-convergence sim-to-real transfer problem by introducing a worst-case performance transference optimization approach, formulated as a convex quadratic-constrained linear programming problem. Extensive experiments demonstrate its effectiveness in transferring RL-based locomotion policies from simulation to real-world laboratory tests. I. INTRODUCTION Figure 1 (b) illustrates the average reward trajectory from training a locomotion policy for the Unitree G1 humanoid robot in Isaac Gym using reinforcement learning (RL) [1] with the random seed being 50. Initially, the randomly initialized policy yields a low training reward.

artificial intelligence, machine learning, real-world performance, (19 more...)

2504.15414

Country:

North America > United States (0.46)
Europe (0.28)

Genre: Research Report (1.00)

Industry: Transportation (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

arXiv.org Machine LearningApr-22-2025

A Geometric Approach to Problems in Optimization and Data Science

Manoj, Naren Sarayu

We give new results for problems in computational and statistical machine learning using tools from high-dimensional geometry and probability. We break up our treatment into two parts. In Part I, we focus on computational considerations in optimization. Specifically, we give new algorithms for approximating convex polytopes in a stream, sparsification and robust least squares regression, and dueling optimization. In Part II, we give new statistical guarantees for data science problems. In particular, we formulate a new model in which we analyze statistical properties of backdoor data poisoning attacks, and we study the robustness of graph clustering algorithms to ``helpful'' misspecification.

artificial intelligence, data mining, machine learning, (21 more...)

arXiv.org Machine Learning

2504.1627

Country:

North America > United States > California > San Francisco County > San Francisco (0.13)
North America > United States > California > Los Angeles County > Los Angeles (0.13)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(19 more...)

Genre:

Workflow (1.00)
Research Report > New Finding (1.00)

Industry:

Leisure & Entertainment (1.00)
Information Technology > Security & Privacy (0.46)
Education > Educational Setting (0.45)
Media > Music (0.45)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.67)
(2 more...)

Wertz, Samuel, Vandaele, Arnaud, Gillis, Nicolas

Efficient algorithms for the Hadamard decomposition

arXiv.org Machine LearningApr-22-2025

The Hadamard decomposition is a powerful technique for data analysis and matrix compression, which decomposes a given matrix into the element-wise product of two or more low-rank matrices. In this paper, we develop an efficient algorithm to solve this problem, leveraging an alternating optimization approach that decomposes the global non-convex problem into a series of convex sub-problems. To improve performance, we explore advanced initialization strategies inspired by the singular value decomposition (SVD) and incorporate acceleration techniques by introducing momentum-based updates. Beyond optimizing the two-matrix case, we also extend the Hadamard decomposition framework to support more than two low-rank matrices, enabling approximations with higher effective ranks while preserving computational efficiency. Finally, we conduct extensive experiments to compare our method with the existing gradient descent-based approaches for the Hadamard decomposition and with traditional low-rank approximation techniques. The results highlight the effectiveness of our proposed method across diverse datasets.

artificial intelligence, machine learning, optimization problem, (18 more...)

arXiv.org Machine Learning

2504.13633

Country: Europe > Belgium (0.04)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.35)