AITopics

Topic models, such as latent Dirichlet allocation (LDA), can be useful tools for the statistical analysis of document collections and other discrete data.The LDA model assumes that the words of each document arise from a mixture of topics, each of which is a distribution over the vocabulary. Alimitation of LDA is the inability to model topic correlation even though, for example, a document about genetics is more likely to also be about disease than x-ray astronomy. This limitation stems from the use of the Dirichlet distribution to model the variability among the topic proportions. In this paper we develop the correlated topic model (CTM), where the topic proportions exhibit correlation via the logistic normal distribution [1]. We derive a mean-field variational inference algorithm forapproximate posterior inference in this model, which is complicated bythe fact that the logistic normal is not conjugate to the multinomial. The CTM gives a better fit than LDA on a collection of OCRed articles from the journal Science. Furthermore, the CTM provides a natural wayof visualizing and exploring this and other unstructured data sets.

equation, health & medicine, neurology, (20 more...)

Country:

North America > United States (0.28)
North America > Canada > Ontario > Toronto (0.14)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology: Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)

Lee, Jung Hoon, Ma, Xiaolong, Likharev, Konstantin K.

CMOL CrossNets: Possible Neuromorphic Nanoelectronic Circuits

Hybrid "CMOL" integrated circuits, combining CMOS subsystem with nanowire crossbars and simple two-terminal nanodevices, promise to extend the exponential Moore-Law development of microelectronics into the sub-10-nm range. We are developing neuromorphic network ("CrossNet") architectures for this future technology, in which neural cell bodies are implemented in CMOS, nanowires are used as axons and dendrites, while nanodevices (bistable latching switches) are used as elementary synapses. We have shown how CrossNets may be trained to perform pattern recovery and classification despite the limitations imposed by the CMOL hardware.

artificial intelligence, crossnet, neural network, (17 more...)

Country: North America > United States (0.14)

Industry: Semiconductors & Electronics (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

An Approximate Inference Approach for the PCA Reconstruction Error

Opper, Manfred

The problem of computing a resample estimate for the reconstruction error in PCA is reformulated as an inference problem with the help of the replica method. Using the expectation consistent (EC) approximation, theintractable inference problem can be solved efficiently using only two variational parameters. A perturbative correction to the result is computed and an alternative simplified derivation is also presented.

approximation, artificial intelligence, machine learning, (16 more...)

Country: Europe > United Kingdom (0.14)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.84)

Tensor Subspace Analysis

He, Xiaofei, Cai, Deng, Niyogi, Partha

Previous work has demonstrated that the image variations of many objects (human faces in particular) under variable lighting can be effectively modeled by low dimensional linear spaces.

algorithm, artificial intelligence, machine learning, (15 more...)

Country: North America > United States > Illinois (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.98)

Silva, Jorge, Marques, Jorge, Lemos, João

Selecting Landmark Points for Sparse Manifold Learning

There has been a surge of interest in learning nonlinear manifold models to approximate high-dimensional data. Both for computational complexity reasonsand for generalization capability, sparsity is a desired feature in such models. This usually means dimensionality reduction, which naturally implies estimating the intrinsic dimension, but it can also mean selecting a subset of the data to use as landmarks, which is especially important becausemany existing algorithms have quadratic complexity in the number of observations.

artificial intelligence, landmark, machine learning, (17 more...)

Country: Europe > Portugal (0.15)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.47)

Maximum Margin Semi-Supervised Learning for Structured Variables

Altun, Y., McAllester, D., Belkin, M.

Many real-world classification problems involve the prediction of multiple interdependent variables forming some structural dependency. Recentprogress in machine learning has mainly focused on supervised classification of such structured variables. In this paper, we investigate structured classification in a semi-supervised setting. We present a discriminative approach that utilizes the intrinsic geometry ofinput patterns revealed by unlabeled data points and we derive a maximum-margin formulation of semi-supervised learning for structured variables.

artificial intelligence, inductive learning, kernel, (17 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.97)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.89)
(2 more...)

Malioutov, Dmitry, Willsky, Alan S., Johnson, Jason K.

Walk-Sum Interpretation and Analysis of Gaussian Belief Propagation

This paper presents a new framework based on walks in a graph for analysis andinference in Gaussian graphical models. The key idea is to decompose correlationsbetween variables as a sum over all walks between those variables in the graph. The weight of each walk is given by a product of edgewise partial correlations. We provide a walk-sum interpretation ofGaussian belief propagation in trees and of the approximate method of loopy belief propagation in graphs with cycles.

artificial intelligence, belief revision, computation tree, (17 more...)

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Belief Revision (0.83)
Information Technology > Artificial Intelligence > Machine Learning (0.67)

Yu, Byron M., Afshar, Afsheen, Santhanam, Gopal, Ryu, Stephen I., Shenoy, Krishna V., Sahani, Maneesh

Extracting Dynamical Structure Embedded in Neural Activity

Spiking activity from neurophysiological experiments often exhibits dynamics beyondthat driven by external stimulation, presumably reflecting the extensive recurrence of neural circuitry. Characterizing these dynamics may reveal important features of neural computation, particularly duringinternally-driven cognitive operations. For example, the activity of premotor cortex (PMd) neurons during an instructed delay periodseparating movement-target specification and a movementinitiation cueis believed to be involved in motor planning. We show that the dynamics underlying this activity can be captured by a lowdimensional non-lineardynamical systems model, with underlying recurrent structure and stochastic point-process output.

health & medicine, neurology, trajectory, (18 more...)

Country: North America > United States > California > Santa Clara County (0.14)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.94)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.69)

Wakin, Michael B., Duarte, Marco F., Sarvotham, Shriram, Baron, Dror, Baraniuk, Richard G.

Recovery of Jointly Sparse Signals from Few Random Projections

Compressed sensing is an emerging field based on the revelation that a small group of linear projections of a sparse signal contains enough information for reconstruction. Inthis paper we introduce a new theory for distributed compressed sensing (DCS) that enables new distributed coding algorithms for multi-signal ensembles that exploit both intra-and inter-signal correlation structures. The DCS theory rests on a new concept that we term the joint sparsity of a signal ensemble. We study three simple models for jointly sparse signals, propose algorithms for joint recovery ofmultiple signals from incoherent projections, and characterize theoretically and empirically the number of measurements per sensor required for accurate reconstruction. Insome sense DCS is a framework for distributed compression of sources with memory, which has remained a challenging problem in information theory for some time. DCS is immediately applicable to a range of problems in sensor networks and arrays.

algorithm, artificial intelligence, survey article, (16 more...)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)

Cesa-bianchi, Nicolò, Gentile, Claudio

Improved risk tail bounds for on-line algorithms

We prove the strongest known bound for the risk of hypotheses selected from the ensemble generated by running a learning algorithm incrementally onthe training data. Our result is based on proof techniques that are remarkably different from the standard risk analysis based on uniform convergence arguments.

artificial intelligence, hypothesis, machine learning, (18 more...)

Country: Europe > Italy (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)