AITopics

2605.24364

Country: North America > United States > Pennsylvania (0.28)

Genre:

Research Report (0.81)
Instructional Material (0.65)

Industry: Health & Medicine (0.45)

Technology:

Information Technology > Data Science > Data Mining (0.87)
Information Technology > Modeling & Simulation (0.86)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.67)
(2 more...)

arXiv.org Artificial IntelligenceNov-25-2025

FastForward Pruning: Efficient LLM Pruning via Single-Step Reinforcement Learning

Yuan, Xin, Li, Siqi, Wei, Jiateng, Zhu, Chengrui, Wu, Yanming, Li, Qingpeng, Lv, Jiajun, Lan, Xiaoke, Chen, Jun, Liu, Yong

Pruning is an effective method for compressing Large Language Models, but finding an optimal, non-uniform layer-wise sparsity allocation remains a key challenge. While heuristic methods are fast but yield suboptimal performance, more powerful search-based approaches like Reinforcement Learning are often hindered by prohibitive computational costs on large-scale models. To overcome this efficiency barrier, we propose FastForward Pruning. Its core is a decoupled, single-step RL framework that separates policy optimization from the complex budget satisfaction problem. Such a decoupling is crucial for efficiently searching the vast policy space of LLMs. This curriculum-based strategy begins with low-cost, simple tasks and gradually increases in complexity, significantly reducing the search's computational overhead. Evaluated on the LLaMA, Mistral, and OPT model families, our framework discovers pruning policies that achieve superior performance over strong heuristic baselines. Crucially, when compared to other search-based algorithms, our method achieves competitive or superior results at a fraction of the computational cost, demonstrating a clear advantage in search efficiency.

arxiv preprint arxiv, large language model, machine learning, (15 more...)

2511.18977

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Vadlamani, Aditya T., Srinivasan, Anutam, Maneriker, Pranav, Payani, Ali, Parthasarathy, Srinivasan

A Generic Framework for Conformal Fairness

arXiv.org Artificial IntelligenceOct-21-2025

Conformal Prediction (CP) is a popular method for uncertainty quantification with machine learning models. While conformal prediction provides probabilistic guarantees regarding the coverage of the true label, these guarantees are agnostic to the presence of sensitive attributes within the dataset. In this work, we formalize \textit{Conformal Fairness}, a notion of fairness using conformal predictors, and provide a theoretically well-founded algorithm and associated framework to control for the gaps in coverage between different sensitive groups. Our framework leverages the exchangeability assumption (implicit to CP) rather than the typical IID assumption, allowing us to apply the notion of Conformal Fairness to data types and tasks that are not IID, such as graph data. Experiments were conducted on graph and tabular datasets to demonstrate that the algorithm can control fairness-related gaps in addition to coverage aligned with theoretical expectations.

artificial intelligence, data mining, machine learning, (16 more...)

2505.16115

Country: North America > United States (0.28)

Genre: Research Report (0.64)

Industry:

Education (0.67)
Government (0.67)
Health & Medicine (0.46)

Technology:

Information Technology > Data Science > Data Mining (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Alkhatib, Yahya, Tay, Wee Peng

On Conformal Machine Unlearning

arXiv.org Machine LearningAug-6-2025

The increasing demand for data privacy, driven by regulations such as GDPR and CCPA, has made Machine Unlearning (MU) essential for removing the influence of specific training samples from machine learning models while preserving performance on retained data. However, most existing MU methods lack rigorous statistical guarantees, rely on heuristic metrics, and often require computationally expensive retraining baselines. To overcome these limitations, we introduce a new definition for MU based on Conformal Prediction (CP), providing statistically sound, uncertainty-aware guarantees without the need for the concept of naive retraining. We formalize conformal criteria that quantify how often forgotten samples are excluded from CP sets, and propose empirical metrics,the Efficiently Covered Frequency (ECF at c) and its complement, the Efficiently Uncovered Frequency (EuCF at d), to measure the effectiveness of unlearning. We further present a practical unlearning method designed to optimize these conformal metrics. Extensive experiments across diverse forgetting scenarios, datasets and models demonstrate the efficacy of our approach in removing targeted data.

machine learning, natural language, prediction, (19 more...)

2508.03245

Country: North America > United States > California (0.04)

Genre: Research Report (0.82)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Koike-Akino, Toshiaki, Liu, Jing, Wang, Ye

$μ$-MoE: Test-Time Pruning as Micro-Grained Mixture-of-Experts

arXiv.org Artificial IntelligenceMay-27-2025

To tackle the huge computational demand of large foundation models, activation-aware compression techniques without retraining have been introduced. However, since these rely on calibration data, domain shift may arise for unknown downstream tasks. With a computationally efficient calibration, activation-aware pruning can be executed for every prompt adaptively, yet achieving reduced complexity at inference. We formulate it as a mixture of micro-experts, called $μ$-MoE. Several experiments demonstrate that $μ$-MoE can dynamically adapt to task/prompt-dependent structured sparsity on the fly.

arxiv preprint arxiv, large language model, machine learning, (18 more...)

2505.18451

Country: North America > United States (0.28)

Genre: Research Report (0.82)

Industry: Education > Curriculum > Subject-Specific Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.99)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.69)

Wang, Jun, Sundarsingh, David Smith, Deshmukh, Jyotirmoy V., Kantaros, Yiannis

ConformalNL2LTL: Translating Natural Language Instructions into Temporal Logic Formulas with Conformal Correctness Guarantees

arXiv.org Artificial IntelligenceMay-1-2025

Linear Temporal Logic (LTL) has become a prevalent specification language for robotic tasks. To mitigate the significant manual effort and expertise required to define LTL-encoded tasks, several methods have been proposed for translating Natural Language (NL) instructions into LTL formulas, which, however, lack correctness guarantees. To address this, we introduce a new NL-to-LTL translation method, called ConformalNL2LTL, that can achieve user-defined translation success rates over unseen NL commands. Our method constructs LTL formulas iteratively by addressing a sequence of open-vocabulary Question-Answering (QA) problems with LLMs. To enable uncertainty-aware translation, we leverage conformal prediction (CP), a distribution-free uncertainty quantification tool for black-box models. CP enables our method to assess the uncertainty in LLM-generated answers, allowing it to proceed with translation when sufficiently confident and request help otherwise. We provide both theoretical and empirical results demonstrating that ConformalNL2LTL achieves user-specified translation accuracy while minimizing help rates.

formula, large language model, natural language, (19 more...)

2504.21022

Country: North America > United States > California > Los Angeles County > Los Angeles (0.28)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.74)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.68)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.48)

Machado, Agathe Fernandes, Grondin, Suzie, Ratz, Philipp, Charpentier, Arthur, Hu, François

EquiPy: Sequential Fairness using Optimal Transport in Python

arXiv.org Artificial IntelligenceMar-12-2025

Algorithmic fairness has received considerable attention due to the failures of various predictive AI systems that have been found to be unfairly biased against subgroups of the population. Many approaches have been proposed to mitigate such biases in predictive systems, however, they often struggle to provide accurate estimates and transparent correction mechanisms in the case where multiple sensitive variables, such as a combination of gender and race, are involved. This paper introduces a new open source Python package, EquiPy, which provides a easy-to-use and model agnostic toolbox for efficiently achieving fairness across multiple sensitive variables. It also offers comprehensive graphic utilities to enable the user to interpret the influence of each sensitive variable within a global context. EquiPy makes use of theoretical results that allow the complexity arising from the use of multiple variables to be broken down into easier-to-solve sub-problems. We demonstrate the ease of use for both mitigation and interpretation on publicly available data derived from the US Census and provide sample code for its use.

artificial intelligence, machine learning, prediction, (15 more...)

2503.09866

Country:

North America > Canada > Quebec (0.14)
Europe > France (0.14)

Genre: Research Report (0.82)

Industry: Energy > Oil & Gas > Upstream (0.34)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.67)

arXiv.org Artificial IntelligenceFeb-27-2025

Machine-learning for photoplethysmography analysis: Benchmarking feature, image, and signal-based approaches

Moulaeifard, Mohammad, Coquelin, Loic, Rinkevičius, Mantas, Sološenko, Andrius, Pfeffer, Oskar, Bench, Ciaran, Hegemann, Nando, Vardanega, Sara, Nandi, Manasi, Alastruey, Jordi, Heiss, Christian, Marozas, Vaidotas, Thompson, Andrew, Aston, Philip J., Charlton, Peter H., Strodthoff, Nils

Photoplethysmography (PPG) is a widely used non-invasive physiological sensing technique, suitable for various clinical applications. Such clinical applications are increasingly supported by machine learning methods, raising the question of the most appropriate input representation and model choice. Comprehensive comparisons, in particular across different input representations, are scarce. We address this gap in the research landscape by a comprehensive benchmarking study covering three kinds of input representations, interpretable features, image representations and raw waveforms, across prototypical regression and classification use cases: blood pressure and atrial fibrillation prediction. In both cases, the best results are achieved by deep neural networks operating on raw time series as input representations. Within this model class, best results are achieved by modern convolutional neural networks (CNNs). but depending on the task setup, shallow CNNs are often also very competitive. We envision that these results will be insightful for researchers to guide their choice on machine learning tasks for PPG data, even beyond the use cases presented in this work.

classification, dataset, detection, (14 more...)

2502.19949

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Europe > United Kingdom > England > Surrey > Guildford (0.04)
Europe > Lithuania > Kaunas County > Kaunas (0.04)
(5 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine > Therapeutic Area > Hematology (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Health Care Technology (1.00)
Health & Medicine > Diagnostic Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Principato, Guillaume, Amara-Ouali, Yvenn, Goude, Yannig, Hamrouche, Bachir, Poggi, Jean-Michel, Stoltz, Gilles

Conformal Prediction for Hierarchical Data

arXiv.org Machine LearningNov-20-2024

Reconciliation has become an essential tool in multivariate point forecasting for hierarchical time series. However, there is still a lack of understanding of the theoretical properties of probabilistic Forecast Reconciliation techniques. Meanwhile, Conformal Prediction is a general framework with growing appeal that provides prediction sets with probabilistic guarantees in finite sample. In this paper, we propose a first step towards combining Conformal Prediction and Forecast Reconciliation by analyzing how including a reconciliation step in the Split Conformal Prediction (SCP) procedure enhances the resulting prediction sets. In particular, we show that the validity granted by SCP remains while improving the efficiency of the prediction sets. We also advocate a variation of the theoretical procedure for practical use. Finally, we illustrate these results with simulations.

conformal prediction, non-conformity score, reconciliation, (14 more...)

2411.13479

Country:

Europe > France (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia (0.04)

Genre: Research Report (0.40)

Industry:

Transportation > Ground > Road (0.67)
Transportation > Electric Vehicle (0.67)
Automobiles & Trucks (0.67)

Technology:

Information Technology > Data Science (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Maneriker, Pranav, Vadlamani, Aditya T., Srinivasan, Anutam, He, Yuntian, Payani, Ali, Parthasarathy, Srinivasan

Benchmarking Graph Conformal Prediction: Empirical Analysis, Scalability, and Theoretical Insights

arXiv.org Machine LearningSep-26-2024

Modern machine learning models trained on losses based on point predictions are prone to be overconfident in their predictions [Guo et al., 2017]. The Conformal Prediction (CP) framework [Vovk et al., 2005] provides a mechanism for generating statistically sound post hoc prediction sets (or intervals, in case of continuous outcomes) with coverage guarantees under mild assumptions. The usual assumption made in CP is that data are exchangeable, i.e., the joint distribution of the data is invariant to permutations of the data points. CP's guarantees are distribution-free and can be added post hoc to arbitrary black-box predictor scores, making them ideal candidates for quantifying uncertainty in complex models, such as neural networks. Network-structured data such as social networks, transportation networks, and biological networks are ubiquitous in modern data science applications.

dataset, efficiency, prediction, (16 more...)

2409.18332

Country:

North America > United States > Ohio (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Nevada > Clark County > Las Vegas (0.04)
North America > United States > California > Los Angeles County > Pasadena (0.04)

Genre: Research Report (1.00)

Industry:

Transportation (0.54)
Information Technology > Services (0.36)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)