Genre
A Heuristic Bayesian Approach to Knowledge Acquisition: Application to Analysis of Tissue-Type Plasminogen Activator
Shachter, Ross D., Eddy, David M., Hasselblad, Vic, Wolpert, Robert
This paper describes a heuristic Bayesian method for computing probability distributions from experimental data, based upon the normal distribution form of the influence diagram. An example illustrates its use in medical technology assessment. This approach facilitates the integration of results from different studies, and permits a medical expert to make proper assessments without considerable statistical training. There has been extensive research on the construction and manipulation of expert systems using probabilities as a measure for uncertainty. These systems are capable of recognizing considerable dependence and of learning from unreliable observations.
The Recovery of Causal Poly-Trees from Statistical Data
Poly-trees are singly connected causal networks in which variables may arise from multiple causes. This paper develops a method of recovering ply-trees from empirically measured probability distributions of pairs of variables. The method guarantees that, if the measured distributions are generated by a causal process structured as a ply-tree then the topological structure of such tree can be recovered precisely and, in addition, the causal directionality of the branches can be determined up to the maximum extent possible. The method also pinpoints the minimum (if any) external semantics required to determine the causal relationships among the variables considered.
Automated Generation of Connectionist Expert Systems for Problems Involving Noise and Redundancy
When creating an expert system, the most difficult and expensive task is constructing a knowledge base. This is particularly true if the problem involves noisy data and redundant measurements. This paper shows how to modify the MACIE process for generating connectionist expert systems from training examples so that it can accommodate noisy and redundant data. The basic idea is to dynamically generate appropriate training examples by constructing both a 'deep' model and a noise model for the underlying problem. The use of winner-take-all groups of variables is also discussed. These techniques are illustrated with a small example that would be very difficult for standard expert system approaches.
The Inductive Logic of Information Systems
An inductive logic can be formulated in which the elements are not propositions or probability distributions, but information systems. The logic is complete for information systems with binary hypotheses, i.e., it applies to all such systems. It is not complete for information systems with more than two hypotheses, but applies to a subset of such systems. The logic is inductive in that conclusions are more informative than premises. Inferences using the formalism have a strong justification in terms of the expected value of the derived information system.
The Automatic Training of Rule Bases that Use Numerical Uncertainty Representations
The use of numerical uncertainty representations allows better modeling of some aspects of human evidential reasoning. It also makes knowledge acquisition and system development, test, and modification more difficult. We propose that where possible, the assignment and/or refinement of rule weights should be performed automatically. We present one approach to performing this training - numerical optimization - and report on the results of some preliminary tests in training rule bases. We also show that truth maintenance can be used to make training more efficient and ask some epistemological questions raised by training rule weights.
Decision Tree Induction Systems: A Bayesian Analysis
Decision tree induction systems are being used for knowledge acquisition in noisy domains. This paper develops a subjective Bayesian interpretation of the task tackled by these systems and the heuristic methods they use. It is argued that decision tree systems implicitly incorporate a prior belief that the simpler (in terms of decision tree complexity) of two hypotheses be preferred, all else being equal, and that they perform a greedy search of the space of decision rules to find one in which there is strong posterior belief. A number of improvements to these systems are then suggested.
Implementing Evidential Reasoning in Expert Systems
However, the theory has not been implemented for reasoning in expert systems due to.its difficulty dealing with uncertain rules. More recently, several extenstions to the theory has been proposed to overcome this difficulty [Yen, 1986a] [Liu, 1986]. Based on Yen's extended DS theory, we have implemented a prototype expert system, named GERTIS (General Evidential Reasoning Tool for Intelligent Systems), that diagnoses rheumatoid arthritis. We chose unspecified polyarthritis as the area of our medical consultation system because the diagnoses form a disease hierarchy, which fits Dempster-Shafer based reasoning best. GERTIS uses the knowledge base of OADIAG-2, a medical expert system developed by Peter Adlassnig [Adlassnig, 1985a,b]. Through the use of OADIAG-2's knowledge base, relevant evidence and rules have been already identified for the area of arthritis. In order to suit the needs of our model, however, the rules of OADIAG-2 were modified and reorganized.
Structuring Causal Tree Models with Continuous Variables
This paper considers the problem of invoking auxiliary, unobservable variables to facilitate the structuring of causal tree models for a given set of continuous variables. Paralleling the treatment of bi-valued variables in [Pearl 1986], we show that if a collection of coupled variables are governed by a joint normal distribution and a tree-structured representation exists, then both the topology and all internal relationships of the tree can be uncovered by observing pairwise dependencies among the observed variables (i.e., the leaves of the tree). Furthermore, the conditions for normally distributed variables are less restrictive than those governing bi-valued variables. The result extends the applications of causal tree models which were found useful in evidential reasoning tasks.
Satisfaction of Assumptions is a Weak Predictor of Performance
This paper demonstrates a methodology for examining the accuracy of uncertain inference systems (UIS), after their parameters have been optimized, and does so for several common UIS's. This methodology may be used to test the accuracy when either the prior assumptions or updating formulae are not exactly satisfied. Surprisingly, these UIS's were revealed to be no more accurate on the average than a simple linear regression. Moreover, even on prior distributions which were deliberately biased so as give very good accuracy, they were less accurate than the simple probabilistic model which assumes marginal independence between inputs. This demonstrates that the importance of updating formulae can outweigh that of prior assumptions. Thus, when UIS's are judged by their final accuracy after optimization, we get completely different results than when they are judged by whether or not their prior assumptions are perfectly satisfied.