Uncertainty
A Mean Field Algorithm for Bayes Learning in Large Feed-forward Neural Networks
In the Bayes approach to statistical inference [Berger, 1985] one assumes that the prior uncertainty about parameters of an unknown data generating mechanism can be encoded in a probability distribution, the so called prior. Using the prior and the likelihood of the data given the parameters, the posterior distribution of the parameters can be derived from Bayes rule. From this posterior, various estimates for functions ofthe parameter, like predictions about unseen data, can be calculated. However, in general, those predictions cannot be realised by specific parameter values, but only by an ensemble average over parameters according to the posterior probability. Hence,exact implementations of Bayes method for neural networks require averages over network parameters which in general can be performed by time consuming 226 M.Opper and O. Winther Monte Carlo procedures.
Learning Exact Patterns of Quasi-synchronization among Spiking Neurons from Data on Multi-unit Recordings
Martignon, Laura, Laskey, Kathryn B., Deco, Gustavo, Vaadia, Eilon
This paper develops arguments for a family of temporal log-linear models to represent spatiotemporal correlations among the spiking events in a group of neurons. The models can represent not just pairwise correlations but also correlations of higher order. Methods are discussed for inferring the existence or absence of correlations and estimating their strength. A frequentist and a Bayesian approach to correlation detection are compared.
Machine-Learning Research
Machine-learning research has been making great progress in many directions. This article summarizes four of these directions and discusses some current open problems. The four directions are (1) the improvement of classification accuracy by learning ensembles of classifiers, (2) methods for scaling up supervised learning algorithms, (3) reinforcement learning, and (4) the learning of complex stochastic models.
The Fifth International Conference on User Modeling
The Fifth International Conference on User Modeling (UM-96) is part of a recently established, biennial conference series that provides a forum for researchers in the field of user modeling and user-adapted interaction. The next major software revolution after graphic user interfaces will be software that adapts itself to the user. By adapting to the user's needs, preferences, knowledge, language, and even moods, software will attain new levels of usability and broad acceptance that would not be possible without built-in models of the user. This conference series provides a forum for recent research in the field, ranging from theoretical foundations to implemented systems to controlled studies of the human-computer interfaces of user-adapted systems.
Does Machine Learning Really Work?
Does machine learning really work? Yes. Over the past decade, machine learning has evolved from a field of laboratory demonstrations to a field of significant commercial value. Machine-learning algorithms have now learned to detect credit card fraud by mining data on past transactions, learned to steer vehicles driving autonomously on public highways at 70 miles an hour, and learned the reading interests of many individuals to assemble personally customized electronic newsAbstracts. A new computational theory of learning is beginning to shed light on fundamental issues, such as the trade-off among the number of training examples available, the number of hypotheses considered, and the likely accuracy of the learned hypothesis. Newer research is beginning to explore issues such as long-term learning of new representations, the integration of Bayesian inference and induction, and life-long cumulative learning. This article, based on the keynote talk presented at the Thirteenth National Conference on Artificial Intelligence, samples a number of recent accomplishments in machine learning and looks at where the field might be headed. [Copyright restrictions preclude electronic publication of this article.]
Fuzzy set theory applied to bend sequencing for sheet metal bending
Brake forming is widely applied in the high variety and small batch part manufacturing of sheet metal components, for the bending of straight bending lines. Currently, the planning of the bending sequences is a task that has to be performed manually, involving many heuristic criteria. However, set-up and bend sequencing procedures and knowledge have to be formally formalized and modeled, for the development of computer-aided process planning systems for sheet metal forming. This paper describes the application of fuzzy set theory for the normalization and modeling of the set-up and bend sequencing process for sheet metal bending. A fuzzy-set based methodology is used to determine the optimal bending sequences for the brake forming of sheet metal components, taking into account the relative importance of handling and accuracy.
EM Optimization of Latent-Variable Density Models
Bishop, Christopher M., Svensén, Markus, Williams, Christopher K. I.
There is currently considerable interest in developing general nonlinear densitymodels based on latent, or hidden, variables. Such models have the ability to discover the presence of a relatively small number of underlying'causes' which, acting in combination, give rise to the apparent complexity of the observed data set. Unfortunately, totrain such models generally requires large computational effort. In this paper we introduce a novel latent variable algorithm which retains the general nonlinear capabilities of previous models but which uses a training procedure based on the EM algorithm. We demonstrate the performance of the model on a toy problem and on data from flow diagnostics for a multiphase oil pipeline.
EM Optimization of Latent-Variable Density Models
Bishop, Christopher M., Svensén, Markus, Williams, Christopher K. I.
There is currently considerable interest in developing general nonlinear density models based on latent, or hidden, variables. Such models have the ability to discover the presence of a relatively small number of underlying'causes' which, acting in combination, give rise to the apparent complexity of the observed data set. Unfortunately, to train such models generally requires large computational effort. In this paper we introduce a novel latent variable algorithm which retains the general nonlinear capabilities of previous models but which uses a training procedure based on the EM algorithm. We demonstrate the performance of the model on a toy problem and on data from flow diagnostics for a multiphase oil pipeline.