AITopics | prop

Collaborating Authors

prop

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Confounder Detection via Treatment Intent: A New Observational Study Design

Plecko, Drago, Okanovic, Patrik, Hoefler, Torsten, Bareinboim, Elias

arXiv.org Machine LearningMay-27-2026

Understanding the effects of interventions is central to scientific progress, with randomized controlled trials (RCTs) regarded as the gold standard for causal inference in many applied fields. However, RCTs are costly, time-consuming, and often constrained by ethical or practical limitations, motivating the need for causal methods able to draw conclusions from observational data. While such data is collected at ever larger scale, making its use for causal inference is often hindered by the fact that not all variables affecting treatment allocation and the outcome are observed - an issue known as unobserved confounding. In this paper, we introduce a new study design called confounder detection via treatment intent. The idea is to query a human expert who makes treatment decisions, and ask them to compare pairs of units proposed by a principled matching strategy, with the goal of eliciting unobserved variables that explain why treatment decisions differ. We provide a theoretical basis for such a procedure, ascertaining conditions under which such a study design may elicit unobserved confounders. Building on this newly established foundations, we study treatment effects of interventions in the intensive care unit (ICU). First, we show empirical evidence strongly indicating that electronic health records (EHRs) collected in ICUs are subject to unobserved confounding. By using clinical text notes as a proxy for physicians' knowledge and leveraging natural language processing, we provide a proof of concept for our methodology in a semi-synthetic environment with a known ground truth.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Machine Learning

2605.26413

Country:

North America > United States (0.28)
Europe (0.28)

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Health Care Technology > Medical Record (0.68)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.68)
Health & Medicine > Health Care Providers & Services (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Add feedback

Predicting missing values: A good idea?

van Buuren, Stef

arXiv.org Machine LearningMay-6-2026

Minimizing the Mean Squared Error (MSE) is a key objective in machine learning and is commonly used for imputing missing values. While this approach provides accurate point estimates, it introduces systematic biases in downstream analyses. These biases affect key parameters such as variance, prevalence, correlation, slope, and explained variance. The root cause is that imputed values optimized for MSE are averages, which reduce the natural variability in the data. This paper demonstrates that adding noise to imputed values can effectively eliminate these biases. The required noise level is proportional to the MSE. Using a toy example in a multivariate normal setting, we compare two methods: predictive imputation, which minimizes MSE, and stochastic imputation, which incorporates random noise. Simulation results show that predictive methods systematically introduce bias, while stochastic methods preserve the data's natural variability and produce unbiased estimates. We also evaluate three popular imputation tools -- missForest, softImpute, and mice -- and observe consistent biases in predictive methods. These findings highlight that MSE is an inadequate measure of imputation quality, as it prioritizes accuracy over variability. Incorporating noise into imputation methods is essential to prevent biases and ensure valid downstream analyses, underscoring the importance of stochastic approaches for handling incomplete data.

artificial intelligence, machine learning, mechanism, (18 more...)

arXiv.org Machine Learning

2605.03733

Genre: Research Report (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

f445ba15f0f05c26e1d24f908ea78d60-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-30-2026, 07:38:48 GMT

G.1 Monoids Monoids are one of the simplest algebraic structure.

artificial intelligence, bisimulation, machine learning, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.46)

Add feedback

On the Universality of Graph Neural Networks on Large Random Graphs

Neural Information Processing SystemsApr-25-2026, 11:50:37 GMT

We study the approximation power of Graph Neural Networks (GNNs) on latent position random graphs. In the large graph limit, GNNs are known to converge to certain "continuous" models known as c-GNNs, which directly enables a study of their approximation power on random graph models. In the absence of input node features however, just as GNNs are limited by the Weisfeiler-Lehman isomorphism test, c-GNNs will be severely limited on simple random graph models. For instance, they will fail to distinguish the communities of a well-separated Stochastic Block Model (SBM) with constant degree function. Thus, we consider recently proposed architectures that augment GNNs with unique node identifiers, referred to as Structural GNNs here (SGNNs). We study the convergence of SGNNs to their continuous counterpart (c-SGNNs) in the large random graph limit, under new conditions on the node identifiers. We then show that c-SGNNs are strictly more powerful than c-GNNs in the continuous limit, and prove their universality on several random graph models of interest, including most SBMs and a large class of random geometric graphs. Our results cover both permutation-invariant and permutation-equivariant architectures.

artificial intelligence, graph, machine learning, (15 more...)

Neural Information Processing Systems

Country: Europe > France (0.28)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

228ffa71ce31ebbdebc6cf413a39cdce-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-25-2026, 01:54:08 GMT

data mining, machine learning, reinforcement learning, (20 more...)

Neural Information Processing Systems

Country: North America > United States (0.67)

Genre: Research Report (0.46)

Industry:

Health & Medicine (0.49)
Law > Civil Rights & Constitutional Law (0.33)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.46)

Add feedback

Supplementary Material

Neural Information Processing SystemsApr-24-2026, 21:31:31 GMT

We now recall basic results concerning the theory of Koopman (i.e.

artificial intelligence, machine learning, operator, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Delightful Policy Gradient

Osband, Ian

arXiv.org Machine LearningMar-17-2026

Standard policy gradients weight each sampled action by advantage alone, regardless of how likely that action was under the current policy. This creates two pathologies: within a single decision context (e.g. one image or prompt), a rare negative-advantage action can disproportionately distort the update direction; across many such contexts in a batch, the expected gradient over-allocates budget to contexts the policy already handles well. We introduce the \textit{Delightful Policy Gradient} (DG), which gates each term with a sigmoid of \emph{delight}, the product of advantage and action surprisal (negative log-probability). For $K$-armed bandits, DG provably improves directional accuracy in a single context and, across multiple contexts, shifts the expected gradient strictly closer to the supervised cross-entropy oracle. This second effect is not variance reduction: it persists even with infinite samples. Empirically, DG outperforms REINFORCE, PPO, and advantage-weighted baselines across MNIST, transformer sequence modeling, and continuous control, with larger gains on harder tasks.

artificial intelligence, baseline, machine learning, (15 more...)

arXiv.org Machine Learning

2603.14608

Genre: Research Report (0.50)

Industry: Health & Medicine (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

ea89621bee7c88b2c5be6681c8ef4906-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-19-2026, 08:35:04 GMT

In contrast, we use 10% of the training set9 for validation, and treat the validation set as apurely held-out test set (this also means that we train on less data).10 Wewillexplainthismoreclearly.30 both spheres are sufficiently tiny (i.e.

artificial intelligence, machine learning, regularization term, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.54)

Add feedback

51cdbd2611e844ece5d80878eb770436-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-19-2026, 01:28:58 GMT

Optimal Transport (OT) + Fairness (R2, R4): Let us highlight two key differences between "Wasserstein Fair10 Classification" (Jiang et al.) and our work. Generally group fairness constraints31 are trying to reflect a certain independence between the prediction and the sensitive attribute.

artificial intelligence, machine learning, prop, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.57)

Add feedback

Filters

Collaborating Authors

prop

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Confounder Detection via Treatment Intent: A New Observational Study Design

Predicting missing values: A good idea?

f445ba15f0f05c26e1d24f908ea78d60-Supplemental-Conference.pdf

e32349fe7e3cd4f9ef598c2b7b7a31f4-Paper-Conference.pdf

On the Universality of Graph Neural Networks on Large Random Graphs

228ffa71ce31ebbdebc6cf413a39cdce-Supplemental-Conference.pdf

Supplementary Material

Delightful Policy Gradient

ea89621bee7c88b2c5be6681c8ef4906-AuthorFeedback.pdf

51cdbd2611e844ece5d80878eb770436-AuthorFeedback.pdf