AITopics

2503.22943

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
North America > United States (0.04)
Asia > China > Beijing > Beijing (0.04)
(5 more...)

Genre: Overview (1.00)

Industry:

Semiconductors & Electronics (1.00)
Information Technology > Software (0.54)
Transportation > Ground (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Hardware (1.00)
Information Technology > Communications > Networks (1.00)
(6 more...)

arXiv.org Machine LearningApr-2-2025

Client Selection in Federated Learning with Data Heterogeneity and Network Latencies

Vardhan, Harsh, Yu, Xiaofan, Rosing, Tajana, Mazumdar, Arya

Federated learning (FL) is a distributed machine learning paradigm where multiple clients conduct local training based on their private data, then the updated models are sent to a central server for global aggregation. The practical convergence of FL is challenged by multiple factors, with the primary hurdle being the heterogeneity among clients. This heterogeneity manifests as data heterogeneity concerning local data distribution and latency heterogeneity during model transmission to the server. While prior research has introduced various efficient client selection methods to alleviate the negative impacts of either of these heterogeneities individually, efficient methods to handle real-world settings where both these heterogeneities exist simultaneously do not exist. In this paper, we propose two novel theoretically optimal client selection schemes that can handle both these heterogeneities. Our methods involve solving simple optimization problems every round obtained by minimizing the theoretical runtime to convergence. Empirical evaluations on 9 datasets with non-iid data distributions, 2 practical delay distributions, and non-convex neural network models demonstrate that our algorithms are at least competitive to and at most 20 times better than best existing baselines.

artificial intelligence, heterogeneity, machine learning, (15 more...)

arXiv.org Machine Learning

2504.01921

Country:

North America > United States > New York (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
North America > Canada > Ontario > Toronto (0.04)

Genre: Research Report (0.82)

Industry: Information Technology (0.87)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.66)

Patel, Oam, Wang, Jason, Nayak, Nikhil Shivakumar, Srinivas, Suraj, Lakkaraju, Himabindu

Towards Interpretable Soft Prompts

arXiv.org Machine LearningApr-2-2025

Soft prompts have been popularized as a cheap and easy way to improve task-specific LLM performance beyond few-shot prompts. Despite their origin as an automated prompting method, however, soft prompts and other trainable prompts remain a black-box method with no immediately interpretable connections to prompting. We create a novel theoretical framework for evaluating the interpretability of trainable prompts based on two desiderata: faithfulness and scrutability. We find that existing methods do not naturally satisfy our proposed interpretability criterion. Instead, our framework inspires a new direction of trainable prompting methods that explicitly optimizes for interpretability. To this end, we formulate and test new interpretability-oriented objective functions for two state-of-the-art prompt tuners: Hard Prompts Made Easy (PEZ) and RLPrompt. Our experiments with GPT-2 demonstrate a fundamental trade-off between interpretability and the task-performance of the trainable prompt, explicating the hardness of the soft prompt interpretability problem and revealing odd behavior that arises when one optimizes for an interpretability proxy.

large language model, machine learning, natural language, (16 more...)

arXiv.org Machine Learning

2504.02144

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.71)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.69)

Hamakawa, Yohei, Kashimata, Tomoya, Yamasaki, Masaya, Tatsumura, Kosuke

Machine Learning-assisted High-speed Combinatorial Optimization with Ising Machines for Dynamically Changing Problems

arXiv.org Artificial IntelligenceApr-2-2025

Quantum or quantum-inspired Ising machines have recently shown promise in solving combinatorial optimization problems in a short time. Real-world applications, such as time division multiple access (TDMA) scheduling for wireless multi-hop networks and financial trading, require solving those problems sequentially where the size and characteristics change dynamically. However, using Ising machines involves challenges to shorten system-wide latency due to the transfer of large Ising model or the cloud access and to determine the parameters for each problem. Here we show a combinatorial optimization method using embedded Ising machines, which enables solving diverse problems at high speed without runtime parameter tuning. We customize the algorithm and circuit architecture of the simulated bifurcation-based Ising machine to compress the Ising model and accelerate computation and then built a machine learning model to estimate appropriate parameters using extensive training data. In TDMA scheduling for wireless multi-hop networks, our demonstration has shown that the sophisticated system can adapt to changes in the problem and showed that it has a speed advantage over conventional methods.

artificial intelligence, machine learning, optimization problem, (14 more...)

2503.23966

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > New Mexico > Los Alamos County > Los Alamos (0.04)
Europe > Hungary > Hajdú-Bihar County > Debrecen (0.04)
Asia > Japan > Honshū > Tōhoku > Fukushima Prefecture > Fukushima (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Banking & Finance > Trading (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Chauhan, Dikshit, Shivani, null, Jung, Donghwi, Yadav, Anupam

Advancements in Multimodal Differential Evolution: A Comprehensive Review and Future Perspectives

Multi-modal optimization involves identifying multiple global and local optima of a function, offering valuable insights into diverse optimal solutions within the search space. Evolutionary algorithms (EAs) excel at finding multiple solutions in a single run, providing a distinct advantage over classical optimization techniques that often require multiple restarts without guarantee of obtaining diverse solutions. Among these EAs, differential evolution (DE) stands out as a powerful and versatile optimizer for continuous parameter spaces. DE has shown significant success in multi-modal optimization by utilizing its population-based search to promote the formation of multiple stable subpopulations, each targeting different optima. Recent advancements in DE for multi-modal optimization have focused on niching methods, parameter adaptation, hybridization with other algorithms including machine learning, and applications across various domains. Given these developments, it is an opportune moment to present a critical review of the latest literature and identify key future research directions. This paper offers a comprehensive overview of recent DE advancements in multimodal optimization, including methods for handling multiple optima, hybridization with EAs, and machine learning, and highlights a range of real-world applications. Additionally, the paper outlines a set of compelling open problems and future research issues from multiple perspectives

artificial intelligence, evolutionary algorithm, machine learning, (15 more...)

2504.00717

Country:

Asia > Singapore (0.04)
Oceania > Australia (0.04)
North America > United States > Washington > King County > Seattle (0.04)
(6 more...)

Genre:

Research Report > Promising Solution (1.00)
Overview (1.00)

Industry:

Energy (1.00)
Information Technology > Security & Privacy (0.92)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.67)
Health & Medicine > Diagnostic Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
(3 more...)

Neural Approaches to SAT Solving: Design Choices and Interpretability

Mojžíšek, David, Hůla, Jan, Li, Ziwei, Zhou, Ziyu, Janota, Mikoláš

Reasoning is a cognitive ability which allows humans to solve problems with previously unseen combinations of constraints. For a long time, it has been debated whether artificial neural networks can obtain such generalization skills or whether they can only learn to detect superficial patterns Fodor and Pylyshyn [1988], Marcus [2003, 2018] without being able to generalize to novel combinations of constraints. With the arrival of Large Language Models (LLMs) specially trained for reasoning Guo et al. [2025], Jaech et al. [2024], it became harder and harder to claim that these models can only detect superficial patterns. Nevertheless, the exact mechanism by which they are able to solve tasks that typically require reasoning is largely unknown and the robustness of the solving process is also not understood. In this contribution, we focus on a restricted class of problems that require reasoning, concretely on solving Boolean formulas in CNF form. This could be viewed as a prototypical task where the goal is to solve problems with novel combinations of constraints, and where detecting superficial patterns seen during training would be insufficient. It has already been demonstrated that Graph Neural Networks (GNNs) can successfully learn to solve such problems and generalize to larger problems Selsam et al. [2018], even though they are still not competitive when compared to state of the art SAT solvers. Understanding the underlying mechanisms GNNs employ to successfully solve problems, as well as their limitations, would offer significant practical and theoretical value.

artificial intelligence, machine learning, natural language, (22 more...)

2504.01173

Country:

Europe > Czechia > Moravian-Silesian Region > Ostrava (0.04)
Europe > Slovenia > Drava > Municipality of Benedikt > Benedikt (0.04)
Europe > Portugal > Lisbon > Lisbon (0.04)
Europe > Czechia > Prague (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(3 more...)

Gallego-Posada, Jose, Ramirez, Juan, Hashemizadeh, Meraj, Lacoste-Julien, Simon

Cooper: A Library for Constrained Optimization in Deep Learning

Cooper is an open-source package for solving constrained optimization problems involving deep learning models. Cooper implements several Lagrangian-based first-order update schemes, making it easy to combine constrained optimization algorithms with high-level features of PyTorch such as automatic differentiation, and specialized deep learning architectures and optimizers. Although Cooper is specifically designed for deep learning applications where gradients are estimated based on mini-batches, it is suitable for general non-convex continuous constrained optimization. Cooper's source code is available at https://github.com/cooper-org/cooper.

artificial intelligence, cooper, machine learning, (14 more...)

2504.01212

Country:

North America > Canada > Quebec (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.84)

Industry: Law (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

A Survey on Unlearnable Data

Li, Jiahao, Chen, Yiqiang, Xing, Yunbing, Gu, Yang, Lan, Xiangyuan

Unlearnable data (ULD) has emerged as an innovative defense technique to prevent machine learning models from learning meaningful patterns from specific data, thus protecting data privacy and security. By introducing perturbations to the training data, ULD degrades model performance, making it difficult for unauthorized models to extract useful representations. Despite the growing significance of ULD, existing surveys predominantly focus on related fields, such as adversarial attacks and machine unlearning, with little attention given to ULD as an independent area of study. This survey fills that gap by offering a comprehensive review of ULD, examining unlearnable data generation methods, public benchmarks, evaluation metrics, theoretical foundations and practical applications. We compare and contrast different ULD approaches, analyzing their strengths, limitations, and trade-offs related to unlearnability, imperceptibility, efficiency and robustness. Moreover, we discuss key challenges, such as balancing perturbation imperceptibility with model degradation and the computational complexity of ULD generation. Finally, we highlight promising future research directions to advance the effectiveness and applicability of ULD, underscoring its potential to become a crucial tool in the evolving landscape of data protection in machine learning.

large language model, machine learning, natural language, (17 more...)

2503.23536

Country:

North America > United States > California (0.14)
Europe > Switzerland > Zürich > Zürich (0.14)
Asia > China > Beijing > Beijing (0.04)
(5 more...)

Genre:

Research Report > Promising Solution (0.45)
Overview > Growing Problem (0.34)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Franks, Ryan, Miroshnikov, Alexey

Explainable post-training bias mitigation with distribution-based fairness metrics

Machine learning (ML) techniques have become ubiquitous in the financial industry due to their powerful predictive performance. However, ML model outputs may lead to certain types of unintended bias, which are measures of unfairness that impact protected sub-populations. Predictive models, and strategies that rely on such models, are subject to laws and regulations that ensure fairness. For instance, financial institutions (FIs) in the U.S. that are in the business of extending credit to applicants are subject to the Equal Credit Opportunity Act (ECOA) [14] and the Fair Housing Act (FHA) [13], which prohibit discrimination in credit offerings and housing transactions. The protected classes identified in the laws, including race, gender, age (subject to very limited exceptions), ethnicity, national origin, and material status, cannot be used as attributes in lending decisions.

artificial intelligence, data mining, machine learning, (18 more...)

2504.01223

Country:

North America > United States > New York > New York County > New York City (0.14)
Europe > Hungary > Budapest > Budapest (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
(3 more...)

Genre: Research Report (1.00)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Banking & Finance (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.93)

Egocentric Conformal Prediction for Safe and Efficient Navigation in Dynamic Cluttered Environments

Shin, Jaeuk, Lee, Jungjin, Yang, Insoon

Since safe control of ego-vehicles depends on accurately predicting the future states of surrounding dynamic agents, numerous motion forecasting models [1, 2] have been developed to forecast an agent's future motions from historical data. Nevertheless, these predictions remain inherently prone to error, primarily because they lack information about hidden contexts or intents--such as agents' goals, velocity preferences, or even social relationships among human agents. To address these limitations, conformal prediction (CP) [3, 4] has been employed to reliably assess the models' predictive capabilities. The method offers a principled yet straightforward procedure for calibrating the models. At test time, the calibration results can be used to construct a confidence set that contains the true future states of the environment, assuming that the test and calibration data are exchangeable (i.e., their joint distribution is symmetric). Consequently, CP has been successfully applied to a variety of problems, including reinforcement learning [5, 6], linear This work was supported in part by the Information and Communications Technology Planning and Evaluation (IITP) grants funded by MSIT No. 2022-0-00124, No. 2022-0-00480 and No. RS-2021-II211343, Artificial Intelligence Graduate School Program (Seoul National University). The authors are with the Department of Electrical and Computer Engineering, ASRI, Seoul National University, Seoul 08826, South Korea,{sju5379, jungbbal, insoonyang }@snu.ac.kr arXiv:2504.00447v1

artificial intelligence, machine learning, score function, (19 more...)

2504.00447

Country:

Asia > South Korea > Seoul > Seoul (0.64)
North America > United States > Michigan (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.82)

Industry:

Energy (0.35)
Education > Educational Setting (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Robots (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.66)