AITopics | Decision Support Systems

Plotting

Decision Support Systems

News Overviews Instructional Materials AI-Alerts Classics

Amortized Bayesian Experimental Design for Decision-Making Yujia Guo Aalto University

Neural Information Processing SystemsMar-27-2025, 07:18:46 GMT

Many critical decisions, such as personalized medical diagnoses and product pricing, are made based on insights gained from designing, observing, and analyzing a series of experiments. This highlights the crucial role of experimental design, which goes beyond merely collecting information on system parameters as in traditional Bayesian experimental design (BED), but also plays a key part in facilitating downstream decision-making. Most recent BED methods use an amortized policy network to rapidly design experiments. However, the information gathered through these methods is suboptimal for down-the-line decision-making, as the experiments are not inherently designed with downstream objectives in mind. In this paper, we present an amortized decision-aware BED framework that prioritizes maximizing downstream decision utility. We introduce a novel architecture, the Transformer Neural Decision Process (TNDP), capable of instantly proposing the next experimental design, whilst inferring the downstream decision, thus effectively amortizing both tasks within a unified workflow. We demonstrate the performance of our method across several tasks, showing that it can deliver informative designs and facilitate accurate decision-making.

decision support system, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country:

Europe > Finland (0.14)
North America > United States (0.14)

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Diagnostic Medicine (0.48)
Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:

Information Technology > Decision Support Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.68)
(4 more...)

Add feedback

Policy-Conditioned Uncertainty Sets for Robust Markov Decision Processes Xiangli Chen Politecnico di Milano

Neural Information Processing SystemsMar-26-2025, 09:55:30 GMT

What policy should be employed in a Markov decision process with uncertain parameters? Robust optimization's answer to this question is to use rectangular uncertainty sets, which independently reflect available knowledge about each state, and then to obtain a decision policy that maximizes the expected reward for the worst-case decision process parameters from these uncertainty sets. While this rectangularity is convenient computationally and leads to tractable solutions, it often produces policies that are too conservative in practice, and does not facilitate knowledge transfer between portions of the state space or across related decision processes. In this work, we propose non-rectangular uncertainty sets that bound marginal moments of state-action features defined over entire trajectories through a decision process. This enables generalization to different portions of the state space while retaining appropriate uncertainty of the decision process. We develop algorithms for solving the resulting robust decision problems, which reduce to finding an optimal policy for a mixture of decision processes, and demonstrate the benefits of our approach experimentally.

constraint, decision support system, machine learning, (20 more...)

Neural Information Processing Systems

Country: North America > United States (0.93)

Technology:

Information Technology > Decision Support Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.71)
(2 more...)

Add feedback

A decision-theoretic approach to dealing with uncertainty in quantum mechanics

De Vos, Keano, de Cooman, Gert, Erreygers, Alexander, De Bock, Jasper

arXiv.org Artificial IntelligenceMar-26-2025

We provide a decision-theoretic framework for dealing with uncertainty in quantum mechanics. This uncertainty is two-fold: on the one hand there may be uncertainty about the state the quantum system is in, and on the other hand, as is essential to quantum mechanical uncertainty, even if the quantum state is known, measurements may still produce an uncertain outcome. In our framework, measurements therefore play the role of acts with an uncertain outcome and our simple decision-theoretic postulates ensure that Born's rule is encapsulated in the utility functions associated with such acts. This approach allows us to uncouple (precise) probability theory from quantum mechanics, in the sense that it leaves room for a more general, so-called imprecise probabilities approach. We discuss the mathematical implications of our findings, which allow us to give a decision-theoretic foundation to recent seminal work by Benavoli, Facchini and Zaffalon, and we compare our approach to earlier and different approaches by Deutsch and Wallace.

artificial intelligence, decision support system, operator, (19 more...)

arXiv.org Artificial Intelligence

2503.20607

Country: Europe (0.67)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Game Theory (0.66)
Information Technology > Decision Support Systems (0.66)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.46)

Add feedback

An Analytical Study of Utility Functions in Multi-Objective Reinforcement Learning

Neural Information Processing SystemsMar-23-2025, 19:07:41 GMT

Multi-objective reinforcement learning (MORL) is an excellent framework for multi-objective sequential decision-making. MORL employs a utility function to aggregate multiple objectives into one that expresses a user's preferences. However, MORL still misses two crucial theoretical analyses of the properties of utility functions: (1) a characterisation of the utility functions for which an associated optimal policy exists, and (2) a characterisation of the types of preferences that can be expressed as utility functions. In this paper, we contribute to both theoretical analyses. As a result, we formally characterise the families of preferences and utility functions that MORL should focus on: those for which an optimal policy is guaranteed to exist. We expect our theoretical results to foster the development of novel MORL algorithms that exploit our theoretical findings.

decision support system, machine learning, reinforcement learning, (20 more...)

Neural Information Processing Systems

Country:

Europe > Spain (0.14)
North America > United States (0.14)

Genre: Research Report > Experimental Study (1.00)

Industry: Information Technology (0.46)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Decision Support Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

5b4a2146246bc3a3a941f32225bbb792-AuthorFeedback.pdf

Neural Information Processing SystemsMar-23-2025, 16:44:34 GMT

We thank the reviewers for the detailed feedback. Why can we assume smoothness / differentiability of the expected utility? The paper should address if there are ways to use unbounded losses (e.g., by switching from utilities directly to inference The assumptions on utilities (and hence on losses) arise from the derivation of the optimization objective (Eq.1) In Section 3.2, we relax these assumptions and provide In practice, the procedure seems to work well for unbounded losses as well. I believe this could be possible by designing a kind of compound loss... Why exactly the utility infimum should be 0? Optimal decisions {h} in Bayesian decision theory remain invariant under linear transformations of u(.).

artificial intelligence, decision support system, machine learning, (11 more...)

Neural Information Processing Systems

Technology:

Information Technology > Game Theory (0.58)
Information Technology > Decision Support Systems (0.58)
Information Technology > Artificial Intelligence > Machine Learning (0.54)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.37)

Add feedback

Generalized Decision Focused Learning under Imprecise Uncertainty--Theoretical Study

Shariatmadar, Keivan, Yorke-Smith, Neil, Osman, Ahmad, Cuzzolin, Fabio, Hallez, Hans, Moens, David

arXiv.org Artificial IntelligenceMar-20-2025

Decision Focused Learning has emerged as a critical paradigm for integrating machine learning with downstream optimisation. Despite its promise, existing methodologies predominantly rely on probabilistic models and focus narrowly on task objectives, overlooking the nuanced challenges posed by epistemic uncertainty, non-probabilistic modelling approaches, and the integration of uncertainty into optimisation constraints. This paper bridges these gaps by introducing innovative frameworks: (i) a non-probabilistic lens for epistemic uncertainty representation, leveraging intervals (the least informative uncertainty model), Contamination (hybrid model), and probability boxes (the most informative uncertainty model); (ii) methodologies to incorporate uncertainty into constraints, expanding Decision-Focused Learning's utility in constrained environments; (iii) the adoption of Imprecise Decision Theory for ambiguity-rich decision-making contexts; and (iv) strategies for addressing sparse data challenges. Empirical evaluations on benchmark optimisation problems demonstrate the efficacy of these approaches in improving decision quality and robustness and dealing with said gaps.

artificial intelligence, decision support system, generalized decision focused learning, (2 more...)

arXiv.org Artificial Intelligence

2502.17984

Genre: Research Report (0.69)

Technology:

Information Technology > Decision Support Systems (0.53)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.53)

Add feedback

Measuring Goal-Directedness

Neural Information Processing SystemsMar-18-2025, 13:34:08 GMT

We define maximum entropy goal-directedness (MEG), a formal measure of goaldirectedness in causal models and Markov decision processes, and give algorithms for computing it. Measuring goal-directedness is important, as it is a critical element of many concerns about harm from AI. It is also of philosophical interest, as goal-directedness is a key aspect of agency. MEG is based on an adaptation of the maximum causal entropy framework used in inverse reinforcement learning. It can measure goal-directedness with respect to a known utility function, a hypothesis class of utility functions, or a set of random variables.

maxent, maximum entropy policy, utility function, (14 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England (0.14)

Technology:

Information Technology > Decision Support Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.69)
(2 more...)

Add feedback

Conformal Prediction and Human Decision Making

Hullman, Jessica, Wu, Yifan, Xie, Dawei, Guo, Ziyang, Gelman, Andrew

arXiv.org Machine LearningMar-18-2025

Methods to quantify uncertainty in predictions from arbitrary models are in demand in high-stakes domains like medicine and finance. Conformal prediction has emerged as a popular method for producing a set of predictions with specified average coverage, in place of a single prediction and confidence value. However, the value of conformal prediction sets to assist human decisions remains elusive due to the murky relationship between coverage guarantees and decision makers' goals and strategies. How should we think about conformal prediction sets as a form of decision support? We outline a decision theoretic framework for evaluating predictive uncertainty as informative signals, then contrast what can be said within this framework about idealized use of calibrated probabilities versus conformal prediction sets. Informed by prior empirical results and theories of human decisions under uncertainty, we formalize a set of possible strategies by which a decision maker might use a prediction set. We identify ways in which conformal prediction sets and posthoc predictive uncertainty quantification more broadly are in tension with common goals and needs in human-AI decision making. We give recommendations for future research in predictive uncertainty quantification to support human decision makers.

decision maker, information, prediction, (14 more...)

arXiv.org Machine Learning

2503.11709

Country: North America > United States (0.14)

Industry: Health & Medicine > Therapeutic Area (0.94)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Decision Support Systems (1.00)
Information Technology > Data Science (1.00)
(4 more...)

Add feedback

Controlling Counterfactual Harm in Decision Support Systems Based on Prediction Sets

Neural Information Processing SystemsMar-17-2025, 23:18:34 GMT

Decision support systems based on prediction sets help humans solve multiclass classification tasks by narrowing down the set of potential label values to a subset of them, namely a prediction set, and asking them to always predict label values from the prediction sets. While this type of systems have been proven to be effective at improving the average accuracy of the predictions made by humans, by restricting human agency, they may cause harm---a human who has succeeded at predicting the ground-truth label of an instance on their own may have failed had they used these systems. In this paper, our goal is to control how frequently a decision support system based on prediction sets may cause harm, by design. To this end, we start by characterizing the above notion of harm using the theoretical framework of structural causal models. Then, we show that, under a natural, albeit unverifiable, monotonicity assumption, we can estimate how frequently a system may cause harm using only predictions made by humans on their own.

decision support system, machine learning, prediction, (6 more...)

Neural Information Processing Systems

Technology:

Information Technology > Decision Support Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.77)

Add feedback

Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision Processes

Neural Information Processing SystemsMar-17-2025, 19:35:59 GMT

We study the evaluation of a policy under best- and worst-case perturbations to a Markov decision process (MDP), using transition observations from the original MDP, whether they are generated under the same or a different policy. This is an important problem when there is the possibility of a shift between historical and future environments, \emph{e.g.} due to unmeasured confounding, distributional shift, or an adversarial environment. We propose a perturbation model that allows changes in the transition kernel densities up to a given multiplicative factor or its reciprocal, extending the classic marginal sensitivity model (MSM) for single time-step decision-making to infinite-horizon RL. We characterize the sharp bounds on policy value under this model -- \emph{i.e.}, the tightest possible bounds based on transition observations from the original MDP -- and we study the estimation of these bounds from such transition observations. We develop an estimator with several important guarantees: it is semiparametrically efficient, and remains so even when certain necessary nuisance functions, such as worst-case Q-functions, are estimated at slow, nonparametric rates.

artificial intelligence, efficient and sharp off-policy evaluation, machine learning, (8 more...)

Neural Information Processing Systems

Technology:

Information Technology > Decision Support Systems (0.64)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.64)

Add feedback