AITopics | expression

Collaborating Authors

expression

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Counterfactually Fair Regression via Optimal Transport

Lince, M. Generali, Gaucher, S., Vie, J-J., Loiseau, P.

arXiv.org Machine LearningMay-28-2026

We consider the problem of learning a counterfactually fair regressor. We adopt a causal uncertainty view in which counterfactual fairness is defined with resampled noise. We focus on obtaining theoretical fairness guarantees for a new post-processing estimator. We begin by showing that counterfactual fairness is equivalent to satisfying demographic parity conditional on the latent variable. This allows us to provide a closed-form expression of the optimal fair regressor via a barycentric quantile map. In order to handle continuous latent variables, we propose a discretized post-processing method. Then, under mild regularity assumptions, we prove high-probability finite-sample fairness guarantees for our estimator, providing an unfairness decay at rate $\tilde O(n^{-1/3})$, and establishing a matching risk bound of order $\tilde O(n^{-1/3})$. We provide a matching lower bound on the excess risk of almost fair predictions. Finally, we extend our results to the setting of relaxed counterfactual fairness. We validate our approach on real-world and synthetic data.

artificial intelligence, machine learning, predictor, (19 more...)

arXiv.org Machine Learning

2605.28251

Country: North America > United States > New York (0.28)

Genre: Research Report > New Finding (0.48)

Industry:

Education (0.94)
Law (0.68)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Symbolic Density Estimation for Discrete Distributions

Liu, Ziwen, Li, Meng

arXiv.org Machine LearningMay-25-2026

Discrete probability laws underpin statistical modeling, yet the catalog of interpretable distributions has expanded only gradually through centuries of case-by-case mathematical derivations. We introduce symbolic density estimation (SDE), an unsupervised framework that automatically recovers closed-form probability mass functions by composing elementary analytic operations within a structured search space. Our method integrates domain-specific structural priors with evolutionary search and a validity-aware inference stage, and it extends to richer distribution families such as zero inflation and finite mixtures. To support systematic evaluation and future research, we contribute a benchmark dataset spanning a broad collection of commonly used discrete distributions. The proposed algorithm recovers all benchmark families with accurate parameter estimates. A real data application shows that it identifies concise and interpretable mixture models that improve goodness-of-fit over standard models.

artificial intelligence, evolutionary algorithm, machine learning, (18 more...)

arXiv.org Machine Learning

2605.21813

Country: North America > United States > New York (0.28)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)

Add feedback

$α$-TCAV: A Unified Framework for Testing with Concept Activation Vectors

Schnoor, Ekkehard, Said, Jawher, Tiomoko, Malik, Samek, Wojciech, Jung, Alexander

arXiv.org Machine LearningMay-18-2026

Concept Activation Vectors (CAVs) are a fundamental tool for concept-based explainability in deep learning, yet their practical utility is limited by statistical instability. We analyze the stochastic nature of CAVs and the Testing with CAVs (TCAV) method, deriving the distributions of major CAV classes including PatternCAV, FastCAV, and ridge regression-based CAVs. We then identify a fundamental flaw in the standard TCAV score: its reliance on a discontinuous indicator function induces non-decaying variance in critical regimes. To address this, we introduce $α$-TCAV, a generalized framework that replaces the indicator with a parameterized smooth function, yielding a unified probabilistic formulation that subsumes both TCAV and Multi-TCAV. We characterize the induced distributions of sensitivity scores and different TCAV variants, showing that established state-of-the-art choices lack theoretical justification. We provide principled guidance on tuning the parameter in $α$-TCAV -- either to imitate Multi-TCAV at substantially lower computational cost, or to obtain a calibrated Bayes-optimal probabilistic measure of a concept's influence. Finally, our analysis yields practical recommendations that challenge established routines: most notably, allocating the full sampling budget to a single CAV rather than splitting it across several.

machine learning, natural language, variance, (18 more...)

arXiv.org Machine Learning

2605.15688

Country:

Europe (1.00)
North America > United States (0.45)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Data Science (0.92)
(2 more...)

Add feedback

InfoSFT: Learn More and Forget Less with Information-Aware Token Weighting

Sabbaghi, Mahdi, Pappas, George, Javanmard, Adel, Hassani, Hamed

arXiv.org Machine LearningMay-15-2026

Supervised fine-tuning (SFT) provides the standard approach for teaching LLMs new behaviors from offline expert demonstrations. However, standard SFT uniformly fits all samples -- including those with low likelihood under the base model -- which can disproportionately drive training updates toward overfitting specific samples rather than learning the target behavior. Moreover, adapting to these unlikely samples induces substantial policy shifts that degrade prior capabilities. Existing methods mitigate this by filtering, regenerating, or down-weighting low-likelihood data. In doing so, they often suppress precisely the novel behaviors the base model has yet to learn. We propose InfoSFT, a principled weighting scheme for the SFT objective that concentrates learning signals on maximally informative, medium-confidence tokens -- those neither overly familiar to the base model nor too unlikely to cause instability. Requiring only a one-line modification to the standard token-wise loss, InfoSFT demonstrably improves generalization over vanilla SFT and likelihood-weighted baselines across math, code, and chain-of-thought tasks with diverse model families, while better preserving pre-existing capabilities.

large language model, machine learning, natural language, (18 more...)

arXiv.org Machine Learning

2605.14967

Country: North America > United States > California (0.28)

Genre: Research Report (0.82)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Do dogs smile? Not like us.

A smile can mean a happy or nervous dog. More information Adding us as a Preferred Source in Google by using this link indicates that you would like to see more of our content in Google News results. This dog might be smiling. Breakthroughs, discoveries, and DIY tips sent six days a week. When you want to use a smile GIF, at least one in 10 are of dogs that grin or appear to smile, with their mouths wide open.

artificial intelligence, expression, physics popular science video space, (10 more...)

Popular Science

Genre: Research Report (0.35)

Industry: Media (0.48)

Technology: Information Technology > Artificial Intelligence (0.51)

Add feedback

Non-asymptotic quantisation of spherically symmetric distributions

Pronzato, Luc, Zhigljavsky, Anatoly

arXiv.org Machine LearningMay-14-2026

Zador's celebrated theorem is a cornerstone of optimal quantisation, establishing both the weak limit of the empirical distribution of an $n$-point optimal quantiser in $R^d$ and the decay rate of the associated $L_s$-mean quantisation error. However, for large dimensions $d$, observing this asymptotic behaviour demands an astronomically large sample size $n$, which grows super-exponentially with $d$. Through a detailed analysis of the quantisation problem for spherically symmetric distributions, we demonstrate that for moderate $n$ random quantisers uniformly distributed on a sphere of suitable radius $r$ achieve exceptional performance. The expected distortion, expressed as a triple integral, can be computed with arbitrary precision, and the optimal radius $r$ can be efficiently determined numerically. Leveraging results from extreme-value theory, we derive approximations for $r$, particularly in scenarios where $n$ scales with $d$. Depending on the growth rate of $n$, $r$ may either converge to zero or approach a limiting value that is independent of $s$.

artificial intelligence, quantiser, sd 1, (16 more...)

arXiv.org Machine Learning

2605.12568

Genre: Research Report (0.65)

Technology: Information Technology > Artificial Intelligence (0.46)

Add feedback

Uniform Scaling Limits in AdamW-Trained Transformers

Gibson, William, Reisinger, Christoph

arXiv.org Machine LearningMay-13-2026

We study the large-depth limit of transformers trained with AdamW, by modelling the hidden-state dynamics as an interacting particle system (IPS) coupled through the attention mechanism. Under appropriate scaling of the attention heads, we prove that the joint dynamics of the hidden states and backpropagated variables converge in $L^2$, uniformly over the initial condition, to the solution of a forward--backward system of ODEs at rate $\mathcal O(L^{-1}+L^{-1/3}H^{-1/2})$. Here, $L$ and $H$ denote the depth and number of heads of the transformer, respectively. The limiting system of ODEs can be identified with a McKean--Vlasov ODE (MVODE) when the attention heads do not incorporate causal masking. By using the flow maps associated with this MVODE and applying concentration of measure techniques, we obtain bounds on the difference between the discrete and continuous models that are uniform over compact sets of initial conditions. As this is achieved without resorting to a covering argument, the constants in our bounds are independent of the number of tokens. Furthermore, under a suitable adaptation to AdamW, the bounds become independent of the token embedding dimension.

large language model, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

2605.11059

Country:

Europe > United Kingdom (0.28)
North America > United States > New York (0.27)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.92)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.67)

Add feedback

Learnability and Competition in High-Dimensional Multi-Component ICA

Genc, Eser Ilke, Demir, Samet, Dogan, Zafer

arXiv.org Machine LearningMay-12-2026

Independent Component Analysis (ICA) is a foundational tool for unsupervised representation learning, yet its high-dimensional theory remains largely limited to single-component recovery. We develop an asymptotically exact mean-field theory for multi-component online ICA, capturing the coupling induced by simultaneous learning and orthogonalization. In the high-dimensional limit, the joint empirical distribution of learned estimates and ground-truth components converges to a deterministic process, yielding a closed ODE system for the overlap matrix between learned directions and true components. This characterization reveals a genuinely multi-component, initialization-driven phase structure: a decoupled regime, where estimates align with distinct components and evolve nearly independently, and a competition regime, where overlapping initializations induce orthogonality-driven conflicts, slow reorientation, and delayed convergence. Our steady-state analysis gives explicit learnability boundaries and competition conditions linking step size, data moments, and initialization. These conditions show that larger higher-order moments and competition shrink the stable learning-rate window, increase convergence times, and predict a staircase phenomenon in which the number of recoverable components changes discretely with the learning rate. Experiments on synthetic data and hyperspectral remote sensing data validate the predicted trajectories and phase behavior.

artificial intelligence, expression, machine learning, (17 more...)

arXiv.org Machine Learning

2605.08552

Genre: Research Report (0.64)

Industry: Energy (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

Price of Quality: Sufficient Conditions for Sparse Recovery using Mixed-Quality Data

Chaabouni, Youssef, Gamarnik, David

arXiv.org Machine LearningMay-12-2026

We study sparse recovery when observations come from mixed-quality sources: a small collection of high-quality measurements with small noise variance and a larger collection of lower-quality measurements with higher variance. For this heterogeneous-noise setting, we establish sample-size conditions for information-theoretic and algorithmic recovery. On the information-theoretic side, we show that it is sufficient for $(n_1, n_2)$ to satisfy a linear trade-off defining the Price of Quality: the number of low-quality samples needed to replace one high-quality sample. In the agnostic setting, where the decoder is completely agnostic to the quality of the data, it is uniformly bounded, and in particular one high-quality sample is never worth more than two low-quality samples for this sufficient condition to hold. In the informed setting, where the decoder is informed of per-sample variances, the price of quality can grow arbitrarily large. On the algorithmic side, we analyze the LASSO in the agnostic setting and show that the recovery threshold matches the homogeneous-noise case and only depends on the average noise level, revealing a striking robustness of computational recovery to data heterogeneity. Together, these results give the first conditions for sparse recovery with mixed-quality data and expose a fundamental difference between how the information-theoretic and algorithmic thresholds adapt to changes in data quality.

artificial intelligence, data quality, machine learning, (19 more...)

arXiv.org Machine Learning

2605.10713

Country: North America > United States > Massachusetts (0.28)

Genre: Research Report (1.00)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Quality (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.67)

Add feedback

Contagious yawning begins in the WOMB, experts reveal - as foetuses are seen copying their mothers' mouth movements

Daily Mail - Science & techMay-11-2026, 12:09:35 GMT

There's nothing quite as contagious as a yawn – and it turns out even babies in the womb aren't immune. Experts have discovered foetuses'catch' yawns from their mothers and have been seen slowly opening and closing their mouths. As part of a study, they recorded the facial expressions of pregnant women while an ultrasound machine captured real-time images of their foetuses' faces. By comparing the two records, the researchers found that foetuses were more likely to yawn after their mothers did, with a delay of around 90 seconds. They said yawning may change the mother's breathing, chest pressure and diaphragm movements, which could provide physical cues the foetus detects.

artificial intelligence, foetuse, yawn, (8 more...)

Daily Mail - Science & tech

Genre: Research Report (0.54)

Industry: Health & Medicine > Therapeutic Area (0.60)

Technology: Information Technology > Artificial Intelligence (0.37)

Add feedback