AITopics | pns

Causal EpiNets: Precision-corrected Bounds on Individual Treatment Effects using Epistemic Neural Networks

Patil, Gandharv, Tang, Keyi, Aoki, Raquel, Guelman, Leo

arXiv.org Machine LearningMay-11-2026

Individual treatment effects are not point-identified from data. The Probability of Necessity and Sufficiency (PNS) circumvents this limitation by characterizing individual-level causality through intersection bounds derived from combined experimental and observational data. In finite samples, however, standard plug-in estimators systematically fail: they violate structural probability constraints and suffer from extremum bias induced by max-min operators, yielding spuriously narrow intervals. We propose a neural framework for finite-sample PNS estimation that resolves both pathologies. We introduce an anchored neural architecture that guarantees structural constraint satisfaction by construction. To correct extremum bias, we employ precision-corrected intersection-bound inference, leveraging Epistemic Neural Networks for scalable, high-dimensional uncertainty quantification. Empirical evaluations confirm that this approach maintains nominal coverage and exact constraint validity in high-dimensional regimes where standard estimators systematically undercover.

artificial intelligence, machine learning, pns, (16 more...)

arXiv.org Machine Learning

2605.07065

Genre: Research Report > Experimental Study (0.46)

Industry: Health & Medicine (0.66)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

1700ad4e6252e8f2955909f96367b34d-Paper-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 19:34:20 GMT

artificial intelligence, machine learning, optimization problem, (18 more...)

Neural Information Processing Systems

Country:

Europe (0.69)
North America > United States (0.46)

Industry: Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Predictive Uncertainty Estimation via Prior Networks

Neural Information Processing SystemsMar-16-2026, 19:55:38 GMT

Estimating how uncertain an AI system is in its predictions is important to improve the safety of such systems. Uncertainty in predictive can result from uncertainty in model parameters, irreducible \emph{data uncertainty} and uncertainty due to distributional mismatch between the test and training data distributions. Different actions might be taken depending on the source of the uncertainty so it is important to be able to distinguish between them. Recently, baseline tasks and metrics have been defined and several practical methods to estimate uncertainty developed. These methods, however, attempt to model uncertainty due to distributional mismatch either implicitly through \emph{model uncertainty} or as \emph{data uncertainty}. This work proposes a new framework for modeling predictive uncertainty called Prior Networks (PNs) which explicitly models \emph{distributional uncertainty}. PNs do this by parameterizing a prior distribution over predictive distributions. This work focuses on uncertainty for classification and evaluates PNs on the tasks of identifying out-of-distribution (OOD) samples and detecting misclassification on the MNIST and CIFAR-10 datasets, where they are found to outperform previous methods. Experiments on synthetic and MNIST and CIFAR-10 data show that unlike previous non-Bayesian methods PNs are able to distinguish between data and distributional uncertainty.

artificial intelligence, machine learning, proceedings, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.77)

Add feedback

1700ad4e6252e8f2955909f96367b34d-Paper-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 15:45:37 GMT

international conference, matrix, verification, (15 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Vaud > Lausanne (0.05)
North America > United States > Florida > Broward County > Fort Lauderdale (0.04)
North America > Canada > Ontario > Toronto (0.04)
(2 more...)

Industry: Government (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Multi-output Polynomial Networks and Factorization Machines

Neural Information Processing SystemsNov-21-2025, 13:16:52 GMT

Factorization machines and polynomial networks are supervised polynomial models based on an efficient low-rank decomposition.

artificial intelligence, basis vector, machine learning, (16 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Kansai > Kyoto Prefecture > Kyoto (0.05)
North America > United States > New York > Tompkins County > Ithaca (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Predictive Uncertainty Estimation via Prior Networks

Neural Information Processing SystemsNov-20-2025, 22:06:35 GMT

Estimating how uncertain an AI system is in its predictions is important to improve the safety of such systems. Uncertainty in predictive can result from uncertainty in model parameters, irreducible \emph{data uncertainty} and uncertainty due to distributional mismatch between the test and training data distributions. Different actions might be taken depending on the source of the uncertainty so it is important to be able to distinguish between them. Recently, baseline tasks and metrics have been defined and several practical methods to estimate uncertainty developed. These methods, however, attempt to model uncertainty due to distributional mismatch either implicitly through \emph{model uncertainty} or as \emph{data uncertainty}. This work proposes a new framework for modeling predictive uncertainty called Prior Networks (PNs) which explicitly models \emph{distributional uncertainty}. PNs do this by parameterizing a prior distribution over predictive distributions. This work focuses on uncertainty for classification and evaluates PNs on the tasks of identifying out-of-distribution (OOD) samples and detecting misclassification on the MNIST and CIFAR-10 datasets, where they are found to outperform previous methods. Experiments on synthetic and MNIST and CIFAR-10 data show that unlike previous non-Bayesian methods PNs are able to distinguish between data and distributional uncertainty.

distributional uncertainty, name change, predictive uncertainty estimation, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.77)

Add feedback

Causal computations in Semi Markovian Structural Causal Models using divide and conquer

Bjøru, Anna Rodum, Cabañas, Rafael, Langseth, Helge, Salmerón, Antonio

arXiv.org Artificial IntelligenceNov-19-2025

Recently, Bjøru et al. proposed a novel divide-and-conquer algorithm for bounding counterfactual probabilities in structural causal models (SCMs). They assumed that the SCMs were learned from purely observational data, leading to an imprecise characterization of the marginal distributions of exogenous variables. Their method leveraged the canonical representation of structural equations to decompose a general SCM with high-cardinality exogenous variables into a set of sub-models with low-cardinality exogenous variables. These sub-models had precise marginals over the exogenous variables and therefore admitted efficient exact inference. The aggregated results were used to bound counterfactual probabilities in the original model. The approach was developed for Markovian models, where each exogenous variable affects only a single endogenous variable. In this paper, we investigate extending the methodology to \textit{semi-Markovian} SCMs, where exogenous variables may influence multiple endogenous variables. Such models are capable of representing confounding relationships that Markovian models cannot. We illustrate the challenges of this extension using a minimal example, which motivates a set of alternative solution strategies. These strategies are evaluated both theoretically and through a computational study.

artificial intelligence, machine learning, semi-markovian model, (18 more...)

arXiv.org Artificial Intelligence

2511.13852

Country: North America > United States (0.28)

Genre: Research Report > Promising Solution (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)

Add feedback

Applying Large Language Models to Characterize Public Narratives

Poole-Dayan, Elinor, Kessler, Daniel T, Chiou, Hannah, Hughes, Margaret, Lin, Emily S, Ganz, Marshall, Roy, Deb

arXiv.org Artificial IntelligenceNov-18-2025

Public Narratives (PNs) are key tools for leadership development and civic mobilization, yet their systematic analysis remains challenging due to their subjective interpretation and the high cost of expert annotation. In this work, we propose a novel computational framework that leverages large language models (LLMs) to automate the qualitative annotation of public narratives. Using a codebook we co-developed with subject-matter experts, we evaluate LLM performance against that of expert annotators. Our work reveals that LLMs can achieve near-human-expert performance, achieving an average F1 score of 0.80 across 8 narratives and 14 codes. We then extend our analysis to empirically explore how PN framework elements manifest across a larger dataset of 22 stories. Lastly, we extrapolate our analysis to a set of political speeches, establishing a novel lens in which to analyze political rhetoric in civic spaces. This study demonstrates the potential of LLM-assisted annotation for scalable narrative analysis and highlights key limitations and directions for future research in computational civic storytelling.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2511.13505

Country:

North America > United States (1.00)
Asia > Middle East > UAE (0.46)

Genre: Research Report > New Finding (0.93)

Industry: Government > Regional Government > North America Government > United States Government (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Causal Sufficiency and Necessity Improves Chain-of-Thought Reasoning

Yu, Xiangning, Wang, Zhuohan, Yang, Linyi, Li, Haoxuan, Liu, Anjie, Xue, Xiao, Wang, Jun, Yang, Mengyue

arXiv.org Artificial IntelligenceOct-28-2025

Chain-of-Thought (CoT) prompting plays an indispensable role in endowing large language models (LLMs) with complex reasoning capabilities. However, CoT currently faces two fundamental challenges: (1) Sufficiency, which ensures that the generated intermediate inference steps comprehensively cover and substantiate the final conclusion; and (2) Necessity, which identifies the inference steps that are truly indispensable for the soundness of the resulting answer. We propose a causal framework that characterizes CoT reasoning through the dual lenses of sufficiency and necessity. Incorporating causal Probability of Sufficiency and Necessity allows us not only to determine which steps are logically sufficient or necessary to the prediction outcome, but also to quantify their actual influence on the final reasoning outcome under different intervention scenarios, thereby enabling the automated addition of missing steps and the pruning of redundant ones. Extensive experimental results on various mathematical and commonsense reasoning benchmarks confirm substantial improvements in reasoning efficiency and reduced token usage without sacrificing accuracy. Our work provides a promising direction for improving LLM reasoning performance and cost-effectiveness.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2506.09853

Country: Asia > China (0.68)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing SystemsOct-3-2025, 04:38:30 GMT

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. Summary: This paper attempts to link sparse optimization methodology to the anatomical structure of locust's early olfactory system. The work is motivated by the observation that odorant molecules are sparsely represented by the population of Kenyon cells. The authors first mathematically formulate the olfactory system as a MAP decoder, and give the standard solution to the problem without considering biological constraints. Next, to make the solution more biologically plausible, the authors reformulate the olfactory system model as a decoder of a compressive sensing problem, and provide two standard solutions to the dual problem. Then, the authors argue that each of the components in the solution can be mapped/interpreted to/as a unit of the biological structure in the olfactory system. However, these maps are described without a strong justification and there are conceptual problems in linking the math with the biology.

gradient descent, locust, matrix, (12 more...)

Neural Information Processing Systems

Country: North America > Canada > Quebec > Montreal (0.04)

Genre: Summary/Review (0.48)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)

Add feedback

Filters

Collaborating Authors

pns

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Causal EpiNets: Precision-corrected Bounds on Individual Treatment Effects using Epistemic Neural Networks

1700ad4e6252e8f2955909f96367b34d-Paper-Conference.pdf

Predictive Uncertainty Estimation via Prior Networks

1700ad4e6252e8f2955909f96367b34d-Paper-Conference.pdf

Multi-output Polynomial Networks and Factorization Machines

Predictive Uncertainty Estimation via Prior Networks

Causal computations in Semi Markovian Structural Causal Models using divide and conquer

Applying Large Language Models to Characterize Public Narratives

Causal Sufficiency and Necessity Improves Chain-of-Thought Reasoning

Export Reviews, Discussions, Author Feedback and Meta-Reviews