Goto

Collaborating Authors

 approximate reasoning


Towards Bayesian Data Selection

arXiv.org Machine Learning

A wide range of machine learning algorithms iteratively add data to the training sample. Examples include semi-supervised learning, active learning, multi-armed bandits, and Bayesian optimization. We embed this kind of data addition into decision theory by framing data selection as a decision problem. This paves the way for finding Bayes-optimal selections of data. For the illustrative case of self-training in semi-supervised learning, we derive the respective Bayes criterion. We further show that deploying this criterion mitigates the issue of confirmation bias by empirically assessing our method for generalized linear models, semi-parametric generalized additive models, and Bayesian neural networks on simulated and real-world data.


Interval-valued fuzzy soft $\beta$-covering approximation spaces

arXiv.org Artificial Intelligence

Subsequently, soft sets and rough sets frequently inspires the exploration Gorzalczany [16] introduced the notion of of theories related to soft covering-based rough interval-valued fuzzy sets, where the membership degree sets [2, 3, 11, 13, 43], attaining substantial relevance of set elements lies within the interval [0,1]. in specific domains. However, in fuzzy environments, Interval-valued fuzzy sets are adept at handling scenarios rough set theory demonstrates inherent limitations, as where precise probabilities of set membership discussed in [42]. To overcome these challenges, Zhang are elusive, offering instead an interval within which and Zhan[42] integrated fuzzy sets, soft sets, and rough such probabilities are constrained [16, 29].


CREPO: An Open Repository to Benchmark Credal Network Algorithms

arXiv.org Artificial Intelligence

Credal networks are a popular class of imprecise probabilistic graphical models obtained as a Bayesian network generalization based on, so-called credal, sets of probability mass functions. A Java library called CREMA has been recently released to model, process and query credal networks. Despite the NP-hardness of the (exact) task, a number of algorithms is available to approximate credal network inferences. In this paper we present CREPO, an open repository of synthetic credal networks, provided together with the exact results of inference tasks on these models. A Python tool is also delivered to load these data and interact with CREMA, thus making extremely easy to evaluate and compare existing and novel inference algorithms. To demonstrate such benchmarking scheme, we propose an approximate heuristic to be used inside variable elimination schemes to keep a bound on the maximum number of vertices generated during the combination step. A CREPO-based validation against approximate procedures based on linearization and exact techniques performed in CREMA is finally discussed.


A geometric approach to conditioning belief functions

arXiv.org Artificial Intelligence

Conditioning is crucial in applied science when inference involving time series is involved. Belief calculus is an effective way of handling such inference in the presence of epistemic uncertainty -- unfortunately, different approaches to conditioning in the belief function framework have been proposed in the past, leaving the matter somewhat unsettled. Inspired by the geometric approach to uncertainty, in this paper we propose an approach to the conditioning of belief functions based on geometrically projecting them onto the simplex associated with the conditioning event in the space of all belief functions. We show here that such a geometric approach to conditioning often produces simple results with straightforward interpretations in terms of degrees of belief. This raises the question of whether classical approaches, such as for instance Dempster's conditioning, can also be reduced to some form of distance minimisation in a suitable space. The study of families of combination rules generated by (geometric) conditioning rules appears to be the natural prosecution of the presented research.


Belief functions induced by random fuzzy sets: Application to statistical inference

arXiv.org Artificial Intelligence

It is based on the representation of elementary pieces of evidence by belief functions (defined as completely monotone set functions) and on their combination by an operator called the product-intersection rule, or Dempster's rule of combination. A belief function can be constructed by comparing a piece evidence to a scale of canonical examples such as randomly coded messages, whose meanings are determined by chance [40]. A belief function on a set Θ can be seen as being induced by a multi-valued mapping from a probability space to Ω; it is mathematically equivalent to a random set [5, 34]. As rational beliefs are essentially determined by evidence, the Dempster-Shafer (DS) theory can be regarded as a general framework for reasoning with uncertainty [11]. Shortly after the introduction of DS theory, Zadeh independently proposed another formalism, called Possibility Theory [54], in which the concept of "fuzzy restriction" plays a


Coherent and Archimedean choice in general Banach spaces

arXiv.org Artificial Intelligence

I introduce and study a new notion of Archimedeanity for binary and non-binary choice between options that live in an abstract Banach space, through a very general class of choice models, called sets of desirable option sets. In order to be able to bring horse lottery options into the fold, I pay special attention to the case where these linear spaces do not include all `constant' options. I consider the frameworks of conservative inference associated with Archimedean (and coherent) choice models, and also pay quite a lot of attention to representation of general (non-binary) choice models in terms of the simpler, binary ones. The representation theorems proved here provide an axiomatic characterisation of, amongst other choice methods, Levi's E-admissibility and Walley--Sen maximality.


A belief combination rule for a large number of sources

arXiv.org Artificial Intelligence

The theory of belief functions is widely used for data from multiple sources. Different evidence combination rules have been proposed in this framework according to the properties of the sources to combine. However, most of these combination rules are not efficient when there are a large number of sources. This is due to either the complexity or the existence of an absorbing element such as the total conflict mass function for the conjunctive based rules when applied on unreliable evidence. In this paper, based on the assumption that the majority of sources are reliable, a combination rule for a large number of sources is proposed using a simple idea: the more common ideas the sources share, the more reliable these sources are supposed to be. This rule is adaptable for aggregating a large number of sources which may not all be reliable. It will keep the spirit of the conjunctive rule to reinforce the belief on the focal elements with which the sources are in agreement. The mass on the emptyset will be kept as an indicator of the conflict. The proposed rule, called LNS-CR (Conjunctive combinationRule for a Large Number of Sources), is evaluated on synthetic mass functions. The experimental results verify that the rule can be effectively used to combine a large number of mass functions and to elicit the major opinion.


A Review of Inference Algorithms for Hybrid Bayesian Networks

Journal of Artificial Intelligence Research

Hybrid Bayesian networks have received an increasing attention during the last years. The difference with respect to standard Bayesian networks is that they can host discrete and continuous variables simultaneously, which extends the applicability of the Bayesian network framework in general. However, this extra feature also comes at a cost: inference in these types of models is computationally more challenging and the underlying models and updating procedures may not even support closed-form solutions. In this paper we provide an overview of the main trends and principled approaches for performing inference in hybrid Bayesian networks. The methods covered in the paper are organized and discussed according to their methodological basis. We consider how the methods have been extended and adapted to also include (hybrid) dynamic Bayesian networks, and we end with an overview of established software systems supporting inference in these types of models.


Decision-Making with Belief Functions: a Review

arXiv.org Artificial Intelligence

Approaches to decision-making under uncertainty in the belief function framework are reviewed. Most methods are shown to blend criteria for decision under ignorance with the maximum expected utility principle of Bayesian decision theory. A distinction is made between methods that construct a complete preference relation among acts, and those that allow incomparability of some acts due to lack of information. Methods developed in the imprecise probability framework are applicable in the Dempster-Shafer context and are also reviewed. Shafer's constructive decision theory, which substitutes the notion of goal for that of utility, is described and contrasted with other approaches. The paper ends by pointing out the need to carry out deeper investigation of fundamental issues related to decision-making with belief functions and to assess the descriptive, normative and prescriptive values of the different approaches.


Logistic Regression, Neural Networks and Dempster-Shafer Theory: a New Perspective

arXiv.org Machine Learning

We revisit logistic regression and its nonlinear extensions, including multilayer feedforward neural networks, by showing that these classifiers can be viewed as converting input or higher-level features into Dempster-Shafer mass functions and aggregating them by Dempster's rule of combination. The probabilistic outputs of these classifiers are the normalized plausibilities corresponding to the underlying combined mass function. This mass function is more informative than the output probability distribution. In particular, it makes it possible to distinguish between lack of evidence (when none of the features provides discriminant information) from conflicting evidence (when different features support different classes). This expressivity of mass functions allows us to gain insight into the role played by each input feature in logistic regression, and to interpret hidden unit outputs in multilayer neural networks. It also makes it possible to use alternative decision rules, such as interval dominance, which select a set of classes when the available evidence does not unambiguously point to a single class, thus trading reduced error rate for higher imprecision.