AITopics

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.65)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.61)

Isabel Valera, Adish Singla, Manuel Gomez Rodriguez

Enhancing the Accuracy and Fairness of Human Decision Making

Neural Information Processing SystemsFeb-12-2026, 05:01:47 GMT

However, these decisions may be imperfect due to limited experience, implicit biases, or faulty probabilistic reasoning.

artificial intelligence, machine learning, threshold, (16 more...)

Country:

Europe > Germany (0.05)
North America > Canada > Quebec > Montreal (0.04)

Industry: Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Neural Information Processing SystemsJan-31-2026, 04:10:27 GMT

A Bayesian Nonparametrics View into Deep Representations

We investigate neural network representations from a probabilistic perspective. Specifically, we leverage Bayesian nonparametrics to construct models of neural activations in Convolutional Neural Networks (CNNs) and latent representations in Variational Autoencoders (VAEs). This allows us to formulate a tractable complexity measure for distributions of neural activations and to explore global structure of latent spaces learned by VAEs. We use this machinery to uncover how memorization and two common forms of regularization, i.e. dropout and input augmentation, influence representational complexity in CNNs. We demonstrate that networks that can exploit patterns in data learn vastly less complex representations than networks forced to memorize.

artificial intelligence, bayesian nonparametric view, machine learning, (12 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Information Processing SystemsAug-15-2025, 13:51:50 GMT

69c49f75ca31620f1f0d38093d9f3d9b-Supplemental-Conference.pdf

hdi 97, hyper mean sd hdi 3, sampler, (13 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.48)

arXiv.org Machine LearningJul-29-2025

TS-Insight: Visualizing Thompson Sampling for Verification and XAI

Vares, Parsa, Durant, Éloi, Pang, Jun, Médoc, Nicolas, Ghoniem, Mohammad

Thompson Sampling (TS) and its variants are powerful Multi-Armed Bandit algorithms used to balance exploration and exploitation strategies in active learning. Yet, their probabilistic nature often turns them into a ``black box'', hindering debugging and trust. We introduce TS-Insight, a visual analytics tool explicitly designed to shed light on the internal decision mechanisms of Thompson Sampling-based algorithms, for model developers. It comprises multiple plots, tracing for each arm the evolving posteriors, evidence counts, and sampling outcomes, enabling the verification, diagnosis, and explainability of exploration/exploitation dynamics. This tool aims at fostering trust and facilitating effective debugging and deployment in complex binary decision-making scenarios especially in sensitive domains requiring interpretable decision-making.

artificial intelligence, data mining, machine learning, (18 more...)

2507.19898

Genre: Research Report (0.52)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.91)

Neural Information Processing SystemsJan-21-2025, 12:26:27 GMT

Review for NeurIPS paper: Conservative Q-Learning for Offline Reinforcement Learning

The theoretical claims of producing lower bounds on Q values are not sufficient since there is no proof that the conservative Q values are anywhere near the true Q values. Just estimating Q 0 for positive rewards could give the same result at theorems 3.1, 3.2, and 3.3. Clearly, the algorithm is doing something smarter than this, but the current analysis does not characterize what the algorithm is doing. The gap expanding result is likely the strongest of the four theorems, but without doing the work to connect this back to why this will actually help performance it is still difficult to judge. Moreover, no comparison is made to the overestimation that would happen without the proposed algorithmic change.

conservative q-learning, neurips paper, offline reinforcement learning, (8 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Nichani, Eshaan, Lee, Jason D., Bietti, Alberto

Understanding Factual Recall in Transformers via Associative Memories

arXiv.org Machine LearningDec-9-2024

Large language models have demonstrated an impressive ability to perform factual recall. Prior work has found that transformers trained on factual recall tasks can store information at a rate proportional to their parameter count. In our work, we show that shallow transformers can use a combination of associative memories to obtain such near optimal storage capacity. We begin by proving that the storage capacities of both linear and MLP associative memories scale linearly with parameter count. We next introduce a synthetic factual recall task, and prove that a transformer with a single layer of self-attention followed by an MLP can obtain 100% accuracy on the task whenever either the total number of self-attention parameters or MLP parameters scales (up to log factors) linearly with the number of facts. In particular, the transformer can trade off between using the value matrices or the MLP as an associative memory to store the dataset of facts. We complement these expressivity results with an analysis of the gradient flow trajectory of a simplified linear attention model trained on our factual recall task, where we show that the model exhibits sequential learning behavior.

alpha, machine learning, natural language, (16 more...)

2412.06538

Country:

North America > United States (0.14)
Europe > France > Occitanie > Haute-Garonne > Toulouse (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.49)

Technology:

Information Technology > Artificial Intelligence > Systems & Languages > Programming Languages (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)

Panda, Saunak Kumar, Liu, Ruiqi, Xiang, Yisha

Online Statistical Inference for Time-varying Sample-averaged Q-learning

arXiv.org Machine LearningOct-14-2024

Reinforcement learning (RL) has emerged as a key approach for training agents in complex and uncertain environments. Incorporating statistical inference in RL algorithms is essential for understanding and managing uncertainty in model performance. This paper introduces a time-varying batch-averaged Q-learning algorithm, termed sampleaveraged Q-learning, which improves upon traditional single-sample Q-learning by aggregating samples of rewards and next states to better account for data variability and uncertainty. We leverage the functional central limit theorem (FCLT) to establish a novel framework that provides insights into the asymptotic normality of the sample-averaged algorithm under mild conditions. Additionally, we develop a random scaling method for interval estimation, enabling the construction of confidence intervals without requiring extra hyperparameters. Numerical experiments conducted on classic OpenAI Gym environments show that the time-varying sample-averaged Q-learning method consistently outperforms both single-sample and constant-batch Q-learning methods, achieving superior accuracy while maintaining comparable learning speeds.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

2410.10737

Country:

North America > United States > Texas (0.28)
Asia > Middle East (0.14)

Genre: Research Report (0.63)

Industry:

Energy > Oil & Gas > Upstream (0.48)
Banking & Finance > Trading (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Paulin, Daniel, Whalley, Peter A., Chada, Neil K., Leimkuhler, Benedict

Sampling from Bayesian Neural Network Posteriors with Symmetric Minibatch Splitting Langevin Dynamics

arXiv.org Machine LearningOct-14-2024

We propose a scalable kinetic Langevin dynamics algorithm for sampling parameter spaces of big data and AI applications. Our scheme combines a symmetric forward/backward sweep over minibatches with a symmetric discretization of Langevin dynamics. For a particular Langevin splitting method (UBU), we show that the resulting Symmetric Minibatch Splitting-UBU (SMS-UBU) integrator has bias $O(h^2 d^{1/2})$ in dimension $d>0$ with stepsize $h>0$, despite only using one minibatch per iteration, thus providing excellent control of the sampling bias as a function of the stepsize. We apply the algorithm to explore local modes of the posterior distribution of Bayesian neural networks (BNNs) and evaluate the calibration performance of the posterior predictive probabilities for neural networks with convolutional neural network architectures for classification problems on three different datasets (Fashion-MNIST, Celeb-A and chest X-ray). Our results indicate that BNNs sampled with SMS-UBU can offer significantly better calibration performance compared to standard methods of training and stochastic weight averaging.

artificial intelligence, machine learning, momentum, (21 more...)