AITopics

Country: North America > United States (1.00)

Industry: Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.88)

arXiv.org Machine LearningApr-7-2026

The Geometric Alignment Tax: Tokenization vs. Continuous Geometry in Scientific Foundation Models

Raju, Prashant C.

Foundation models for biology and physics optimize predictive accuracy, but their internal representations systematically fail to preserve the continuous geometry of the systems they model. We identify the root cause: the Geometric Alignment Tax, an intrinsic cost of forcing continuous manifolds through discrete categorical bottlenecks. Controlled ablations on synthetic dynamical systems demonstrate that replacing cross-entropy with a continuous head on an identical encoder reduces geometric distortion by up to 8.5x, while learned codebooks exhibit a non-monotonic double bind where finer quantization worsens geometry despite improving reconstruction. Under continuous objectives, three architectures differ by 1.3x; under discrete tokenization, they diverge by 3,000x. Evaluating 14 biological foundation models with rate-distortion theory and MINE, we identify three failure regimes: Local-Global Decoupling, Representational Compression, and Geometric Vacuity. A controlled experiment confirms that Evo 2's reverse-complement robustness on real DNA reflects conserved sequence composition, not learned symmetry. No model achieves simultaneously low distortion, high mutual information, and global coherence.

artificial intelligence, deep learning, machine learning, (18 more...)

arXiv.org Machine Learning

2604.04155

Country:

North America > United States > Tennessee > Davidson County > Nashville (0.04)
Europe > United Kingdom > England (0.04)

Genre:

Research Report > Experimental Study (0.68)
Research Report > New Finding (0.46)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.45)

Neural Information Processing SystemsFeb-11-2026, 05:52:01 GMT

AuditingDifferentiallyPrivateMachineLearning

We do so via novel data poisoning attacks, which weshowcorrespond torealistic privacyattacks.

artificial intelligence, differential privacy, machine learning, (18 more...)

Country:

North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Asia > China > Shanghai > Shanghai (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Information Technology > Security & Privacy (0.94)
Transportation (0.69)

Technology:

Information Technology > Security & Privacy (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Neural Information Processing SystemsFeb-9-2026, 12:15:12 GMT

A Proof of Soft breakdown point

We also report the accuracy of the base and smooth classifier (binary attr.).

artificial intelligence, pert, sm gdc, (14 more...)

Technology: Information Technology > Artificial Intelligence (0.94)

Belde, Abhinay Shankar, Ramkumar, Rohit, Rusert, Jonathan

Overcoming Black-box Attack Inefficiency with Hybrid and Dynamic Select Algorithms

arXiv.org Artificial IntelligenceSep-26-2025

Adversarial text attack research plays a crucial role in evaluating the robustness of NLP models. However, the increasing complexity of transformer-based architectures has dramatically raised the computational cost of attack testing, especially for researchers with limited resources (e.g., GPUs). Existing popular black-box attack methods often require a large number of queries, which can make them inefficient and impractical for researchers. To address these challenges, we propose two new attack selection strategies called Hybrid and Dynamic Select, which better combine the strengths of previous selection algorithms. Hybrid Select merges generalized BinarySelect techniques with GreedySelect by introducing a size threshold to decide which selection algorithm to use. Dynamic Select provides an alternative approach of combining the generalized Binary and GreedySelect by learning which lengths of texts each selection method should be applied to. This greatly reduces the number of queries needed while maintaining attack effectiveness (a limitation of BinarySelect). Across 4 datasets and 6 target models, our best method(sentence-level Hybrid Select) is able to reduce the number of required queries per attack up 25.82\% on average against both encoder models and LLMs, without losing the effectiveness of the attack.

accuracy, large language model, machine learning, (16 more...)

2509.20699

Country:

Europe (0.93)
North America > United States > Minnesota (0.28)
Asia > Middle East > UAE (0.28)

Genre: Research Report > New Finding (1.00)

Industry: Transportation > Air (0.61)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Penquitt, Sarina, Riedlinger, Tobias, Heller, Timo, Reischl, Markus, Rottmann, Matthias

Learning to Detect Label Errors by Making Them: A Method for Segmentation and Object Detection Datasets

arXiv.org Artificial IntelligenceAug-26-2025

--Recently, detection of label errors and improvement of label quality in datasets for supervised learning tasks has become an increasingly important goal in both research and industry. The consequences of incorrectly annotated data include reduced model performance, biased benchmark results, and lower overall accuracy. Current state-of-the-art label error detection methods often focus on a single computer vision task and, consequently, a specific type of dataset, containing, for example, either bounding boxes or pixel-wise annotations. Furthermore, previous methods are not learning-based. In this work, we overcome this research gap. We present a unified method for detecting label errors in object detection, semantic segmentation, and instance segmentation datasets. In a nutshell, our approach - learning to detect label errors by making them - works as follows: we inject different kinds of label errors into the ground truth. Then, the detection of label errors, across all mentioned primary tasks, is framed as an instance segmentation problem based on a composite input. In our experiments, we compare the label error detection performance of our method with various baselines and state-of-the-art approaches of each task's domain on simulated label errors across multiple tasks, datasets, and base models. This is complemented by a generalization study on real-world label errors. Additionally, we release 459 real label errors identified in the Cityscapes dataset and provide a benchmark for real label error detection in Cityscapes. Deep learning thrives on data: the more complex the task, the more data is required. In computer vision, larger training datasets consistently improve model performance [1], driving demand for large-scale, high-quality annotations.

artificial intelligence, label error, machine learning, (15 more...)

2508.1793

Country: Europe > Germany (0.68)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Neural Information Processing SystemsAug-15-2025, 08:00:33 GMT

99e314b1b43706773153e7ef375fc68c-Supplemental.pdf

gdc, pert, sm gdc, (13 more...)

Technology: Information Technology > Artificial Intelligence (0.94)

Carmona, René, Laurière, Mathieu, Tan, Zongjun

Linear-Quadratic Mean-Field Reinforcement Learning: Convergence of Policy Gradient Methods

arXiv.org Artificial IntelligenceApr-30-2025

We investigate reinforcement learning in the setting of Markov decision processes for a large number of exchangeable agents interacting in a mean field manner. Applications include, for example, the control of a large number of robots communicating through a central unit dispatching the optimal policy computed by maximizing an aggregate reward. An approximate solution is obtained by learning the optimal policy of a generic agent interacting with the statistical distribution of the states and actions of the other agents. We first provide a full analysis this discrete-time mean field control problem. We then rigorously prove the convergence of exact and model-free policy gradient methods in a mean-field linear-quadratic setting and establish bounds on the rates of convergence. We also provide graphical evidence of the convergence based on implementations of our algorithms.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

1910.04295

Country: North America > United States (0.45)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Neural Information Processing SystemsFeb-10-2025, 02:47:11 GMT

from a domain e 2E

In this paper, we assume that the (b) Concept shift. Thus, in the above SCM, X and e are concept shift. We compare structural causal models (SCMs) for covariate shift and concept shift. The language of causal inference provides further intuition for the structure imposed on Problem 3.1 by Assumptions 4.1 and 4.2. In particular, the structural causal model (SCM) for problems in which data is generated according to the mechanism described in Assumptions 4.1 and 4.2 is shown in Figure 7a. Recall that in Assumption 4.1 imposes that X and e are causes of the random variable X X. Further, in Assumption 4.2, we assume that P(Y To offer a point of comparison, in Figure 7b, we show a different SCM that does not fulfill our assumptions.

artificial intelligence, machine learning, optimization problem, (15 more...)

Genre:

Workflow (0.46)
Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.46)

Nouli, Georgina, Parmentier, Axel, Schiffer, Maximilian

Preference-aware compensation policies for crowdsourced on-demand services

arXiv.org Artificial IntelligenceFeb-7-2025

Crowdsourced on-demand services offer benefits such as reduced costs, faster service fulfillment times, greater adaptability, and contributions to sustainable urban transportation in on-demand delivery contexts. However, the success of an on-demand platform that utilizes crowdsourcing relies on finding a compensation policy that strikes a balance between creating attractive offers for gig workers and ensuring profitability. In this work, we examine a dynamic pricing problem for an on-demand platform that sets request-specific compensation of gig workers in a discrete-time framework, where requests and workers arrive stochastically. The operator's goal is to determine a compensation policy that maximizes the total expected reward over the time horizon. Our approach introduces compensation strategies that explicitly account for gig worker request preferences. To achieve this, we employ the Multinomial Logit model to represent the acceptance probabilities of gig workers, and, as a result, derive an analytical solution that utilizes post-decision states. Subsequently, we integrate this solution into an approximate dynamic programming algorithm. We compare our algorithm against benchmark algorithms, including formula-based policies and an upper bound provided by the full information linear programming solution. Our algorithm demonstrates consistent performance across diverse settings, achieving improvements of at least 2.5-7.5% in homogeneous gig worker populations and 9% in heterogeneous populations over benchmarks, based on fully synthetic data.

algorithm, compensation, gig worker, (15 more...)