AITopics

1312.6182

Country: North America > United States > California (0.14)

Genre: Research Report (0.50)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.63)

Schmidt, Mikkel N., Mørup, Morten

Non-parametric Bayesian modeling of complex networks

arXiv.org Machine LearningDec-20-2013

Modeling structure in complex networks using Bayesian non-parametrics makes it possible to specify flexible model structures and infer the adequate model complexity from the observed data. This paper provides a gentle introduction to non-parametric Bayesian modeling of complex networks: Using an infinite mixture model as running example we go through the steps of deriving the model as an infinite limit of a finite parametric model, inferring the model parameters by Markov chain Monte Carlo, and checking the model's fit and predictive performance. We explain how advanced non-parametric models for complex networks can be derived and point out relevant literature.

artificial intelligence, machine learning, node, (17 more...)

doi: 10.1109/MSP.2012.2235191

1312.5889

Country: North America > United States (0.28)

Genre:

Research Report (0.50)
Instructional Material > Course Syllabus & Notes (0.46)
Overview (0.46)

Industry: Health & Medicine > Therapeutic Area (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Deleforge, Antoine, Forbes, Florence, Horaud, Radu

High-Dimensional Regression with Gaussian Mixtures and Partially-Latent Response Variables

arXiv.org Machine LearningDec-20-2013

In this work we address the problem of approximating high-dimensional data with a low-dimensional representation. We make the following contributions. We propose an inverse regression method which exchanges the roles of input and response, such that the low-dimensional variable becomes the regressor, and which is tractable. We introduce a mixture of locally-linear probabilistic mapping model that starts with estimating the parameters of inverse regression, and follows with inferring closed-form solutions for the forward parameters of the high-dimensional regression problem of interest. Moreover, we introduce a partially-latent paradigm, such that the vector-valued response variable is composed of both observed and latent entries, thus being able to deal with data contaminated by experimental artifacts that cannot be explained with noise models. The proposed probabilistic formulation could be viewed as a latent-variable augmentation of regression. We devise expectation-maximization (EM) procedures based on a data augmentation strategy which facilitates the maximum-likelihood search over the model parameters. We propose two augmentation schemes and we describe in detail the associated EM inference procedures that may well be viewed as generalizations of a number of EM regression, dimension reduction, and factor analysis algorithms. The proposed framework is validated with both synthetic and real data. We provide experimental evidence that our method outperforms several existing regression techniques.

artificial intelligence, machine learning, regression, (14 more...)

doi: 10.1007/s11222-014-9461-5

1308.2302

Country:

Europe (1.00)
North America > United States > California (0.28)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.66)

Clémençon, Stéphan, Depecker, Marine

Functional Bipartite Ranking: a Wavelet-Based Filtering Approach

arXiv.org Machine LearningDec-18-2013

Functional Classification, i.e. the binary classification problem when the input observation X (X(t)) is of the form of a (possibly sampled) random curve/function and the output variable Y { 1, 1} is a binary label, has been the subject of a good deal of attention in the machine-learning literature in the past few years, see [1] or [2]. In contrast, Bipartite Ranking, termed Nonparametric Scoring sometimes, has never been tackled in a functional framework, except from the restrictive angle of Functional Logistic Regression, see [3] or [4] for instance. This global learning task consists in ordering all possible input observations X so that positive ones appear on top of the list with highest probability. This predictive problem, which can be cast in terms of ROC curve optimization (see [5]), covers a wide variety of applications, ranging from anomaly detection in signal processing to automatic design of diagnosis tools in medicine through creditscoring in mathematical finance or the conception of search engines in information retrieval. Functional versions of many popular approaches for classification have been developed, relying in general on a preliminary finite dimensional representation/projection of the input data.

artificial intelligence, coefficient, machine learning, (15 more...)

1312.5066

Country:

Europe (0.28)
North America > United States (0.14)

Genre:

Research Report > New Finding (0.66)
Research Report > Experimental Study (0.66)

Industry: Health & Medicine (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.34)

Fuchs, Mathias, Hornung, Roman, De Bin, Riccardo, Boulesteix, Anne-Laure

A U-statistic estimator for the variance of resampling-based error estimators

arXiv.org Machine LearningDec-18-2013

The goal of supervised statistical learning is to develop prediction rules taking the values of predictor variables as input and returning a predicted value of the response variable. A prediction rule is typically learnt by applying a learning algorithm M to a so-called learning data set. A typical example in biomedical research is the prediction of patient outcome (e.g. The practitioners are usually interested in the accuracy of the prediction rule learnt from their data set to predict future patients, while methodological researchers rather want to know whether the learning algorithm is good at learning accurate prediction rules for different data sets drawn from a distribution of interest. The first perspective is called "conditional" (since referring to a specific data set) while the latter, which we take in this paper, is denoted as "unconditional". If the data set is very large, one can observe independent realizations of estimators of the unconditional error rates and use them for a paired t-test (see Section 2.3). In practise, however, huge data sets are rarely available. Prediction errors are thus usually estimated by resampling procedures consisting of splitting the available data set into learning and test sets a large number of times and averaging the estimated error over these iterations.

artificial intelligence, estimator, machine learning, (17 more...)

1310.8203

Genre: Research Report > Experimental Study (0.66)

Industry: Health & Medicine > Therapeutic Area > Oncology (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Freris, Nikolaos M., Öçal, Orhan, Vetterli, Martin

Recursive Compressed Sensing

We introduce a recursive algorithm for performing compressed sensing on streaming data. The approach consists of a) recursive encoding, where we sample the input stream via overlapping windowing and make use of the previous measurement in obtaining the next one, and b) recursive decoding, where the signal estimate from the previous window is utilized in order to achieve faster convergence in an iterative optimization scheme applied to decode the new one. To remove estimation bias, a two-step estimation procedure is proposed comprising support set detection and signal amplitude estimation. Estimation accuracy is enhanced by a non-linear voting method and averaging estimates over multiple windows. We analyze the computational complexity and estimation error, and show that the normalized error variance asymptotically goes to zero for sublinear sparsity. Our simulation results show speed up of an order of magnitude over traditional CS, while obtaining significantly lower reconstruction error under mild conditions on the signal magnitudes and the noise level.

artificial intelligence, machine learning, programming language, (20 more...)

1312.4895

Country:

North America > United States (0.28)
Europe (0.28)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
(3 more...)

Frigola, Roger, Lindsten, Fredrik, Schön, Thomas B., Rasmussen, Carl E.

Identification of Gaussian Process State-Space Models with Particle Stochastic Approximation EM

Dept. of Information Technology, Uppsala University, Sweden.Abstract: Gaussian process state-space models (GP-SSMs) are a very flexible family of models of nonlinear dynamical systems. They comprise a Bayesian nonparametric representation of the dynamics of the system and additional (hyper-)parameters governing the properties of this nonparametric representation. The Bayesian formalism enables systematic reasoning about the uncertainty in the system dynamics. We present an approach to maximum likelihood identification of the parameters in GP-SSMs, while retaining the full nonparametric description of the dynamics. The method is based on a stochastic approximation version of the EM algorithm that employs recent developments in particle Markov chain Monte Carlo for efficient identification. INTRODUCTION Inspired by recent developments in robotics and machine learning, we aim at constructing models of nonlinear dynamical systems capable of quantifying the uncertainty in their predictions.

artificial intelligence, gp-ssm, machine learning, (17 more...)

1312.4852

Country:

Europe > United Kingdom > England (0.28)
Europe > Sweden > Uppsala County > Uppsala (0.24)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.50)

The Bernstein Function: A Unifying Framework of Nonconvex Penalization in Sparse Estimation

Zhang, Zhihua

In this paper we study nonconvex penalization using Bernstein functions. Since the Bernstein function is concave and nonsmooth at the origin, it can induce a class of nonconvex functions for high-dimensional sparse estimation problems. We derive a threshold function based on the Bernstein penalty and give its mathematical properties in sparsity modeling. We show that a coordinate descent algorithm is especially appropriate for penalized regression problems with the Bernstein penalty. Additionally, we prove that the Bernstein function can be defined as the concave conjugate of a $\varphi$-divergence and develop a conjugate maximization algorithm for finding the sparse solution. Finally, we particularly exemplify a family of Bernstein nonconvex penalties based on a generalized Gamma measure and conduct empirical analysis for this family.

artificial intelligence, bernstein function, machine learning, (17 more...)

1312.4719

Country:

North America > United States (0.46)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.28)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.67)

The Matrix Ridge Approximation: Algorithms and Applications

Zhang, Zhihua

We are concerned with an approximation problem for a symmetric positive semidefinite matrix due to motivation from a class of nonlinear machine learning methods. We discuss an approximation approach that we call {matrix ridge approximation}. In particular, we define the matrix ridge approximation as an incomplete matrix factorization plus a ridge term. Moreover, we present probabilistic interpretations using a normal latent variable model and a Wishart model for this approximation approach. The idea behind the latent variable model in turn leads us to an efficient EM iterative method for handling the matrix ridge approximation problem. Finally, we illustrate the applications of the approximation approach in multivariate data analysis. Empirical studies in spectral clustering and Gaussian process regression show that the matrix ridge approximation with the EM iteration is potentially useful.

approximation, artificial intelligence, machine learning, (17 more...)

1312.4717

Country: Asia (0.28)

Genre: Research Report (0.63)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.87)

Frigola, Roger, Lindsten, Fredrik, Schön, Thomas B., Rasmussen, Carl E.

Bayesian Inference and Learning in Gaussian Process State-Space Models with Particle MCMC

State-space models are successfully used in many areas of science, engineering and economics to model time series and dynamical systems. We present a fully Bayesian approach to inference \emph{and learning} (i.e. state estimation and system identification) in nonlinear nonparametric state-space models. We place a Gaussian process prior over the state transition dynamics, resulting in a flexible model able to capture complex dynamical phenomena. To enable efficient inference, we marginalize over the transition dynamics function and infer directly the joint smoothing distribution using specially tailored Particle Markov Chain Monte Carlo samplers. Once a sample from the smoothing distribution is computed, the state transition predictive distribution can be formulated analytically. Our approach preserves the full nonparametric expressivity of the model and can make use of sparse Gaussian processes to greatly reduce computational complexity.

artificial intelligence, machine learning, trajectory, (16 more...)

1306.2861

Country: Europe > United Kingdom > England (0.28)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.48)