AITopics

Complexity theory of circuits strongly suggests that deep architectures can be much more efficient (sometimes exponentially) than shallow architectures, in terms of computational elements required to represent some functions. Deep multi-layer neural networks have many levels of non-linearities allowing them to compactly represent highly nonlinear and highly-varying functions. However, until recently it was not clear how to train such deep networks, since gradient-based optimization starting from random initialization appears to often get stuck in poor solutions.

algorithm, artificial intelligence, machine learning, (20 more...)

Country: North America > United States (0.14)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)

Braun, Mikio L., Müller, Klaus-Robert, Buhmann, Joachim M.

Denoising and Dimension Reduction in Feature Space

We show that the relevant information about a classification problem in feature space is contained up to negligible error in a finite number of leading kernel PCA components if the kernel matches the underlying learning problem. Thus, kernels notonly transform data sets such that good generalization can be achieved even by linear discriminant functions, but this transformation is also performed in a manner which makes economic use of feature space dimensions. In the best case, kernels provide efficient implicit representations of the data to perform classification. Practically,we propose an algorithm which enables us to recover the subspace and dimensionality relevant for good classification. Our algorithm can therefore be applied (1) to analyze the interplay of data set and kernel in a geometric fashion,(2) to help in model selection, and to (3) de-noise in feature space in order to yield better classification results.

artificial intelligence, kernel pca component, machine learning, (16 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Lukšys, Gediminas, Knüsel, Jérémie, Sheynikhovich, Denis, Sandi, Carmen, Gerstner, Wulfram

Effects of Stress and Genotype on Meta-parameter Dynamics in Reinforcement Learning

Stress and genetic background regulate different aspects of behavioral learning through the action of stress hormones and neuromodulators.

experiment, machine learning, reinforcement learning, (15 more...)

Country: Europe (0.28)

Genre: Research Report (0.48)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.65)

Rubinstein, Benjamin I., Bartlett, Peter L., Rubinstein, J. H.

Shifting, One-Inclusion Mistake Bounds and Tight Multiclass Expected Risk Bounds

Under the prediction model of learning, a prediction strategy is presented with an i.i.d.

artificial intelligence, machine learning, prediction strategy, (15 more...)

Country: North America > United States > California > Alameda County > Berkeley (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.91)

A Humanlike Predictor of Facial Attractiveness

Kagian, Amit, Dror, Gideon, Leyvand, Tommer, Cohen-or, Daniel, Ruppin, Eytan

This work presents a method for estimating human facial attractiveness, based on supervised learning techniques. Numerous facial features that describe facial geometry, color and texture, combined with an average human attractiveness score for each facial image, are used to train various predictors. Facial attractiveness ratings produced by the final predictor are found to be highly correlated with human ratings, markedly improving previous machine learning achievements. Simulated psychophysical experiments with virtually manipulated images reveal preferences in the machine's judgments which are remarkably similar to those of humans. These experiments shed new light on existing theories of facial attractiveness such as the averageness, smoothness and symmetry hypotheses. It is intriguing to find that a machine trained explicitly to capture an operational performance criteria such as attractiveness rating, implicitly captures basic human psychophysical biases characterizing the perception of facial attractiveness in general.

artificial intelligence, attractiveness, machine learning, (18 more...)

Country:

North America > United States (0.29)
Europe > Austria (0.28)
Asia > Middle East > Israel (0.15)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision > Face Recognition (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.68)

In-Network PCA and Anomaly Detection

Huang, Ling, Nguyen, XuanLong, Garofalakis, Minos, Jordan, Michael I., Joseph, Anthony, Taft, Nina

We consider the problem of network anomaly detection in large distributed systems. In this setting, Principal Component Analysis (PCA) has been proposed as a method for discovering anomaliesby continuously tracking the projection of the data onto a residual subspace. This method was shown to work well empirically in highly aggregated networks, that is, those with a limited number of large nodes and at coarse time scales.

coordinator, data mining, machine learning, (17 more...)

Country: North America > United States > California > Alameda County > Berkeley (0.15)

Industry: Information Technology > Security & Privacy (0.94)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Richter, Silvia, Aberdeen, Douglas, Yu, Jin

Natural Actor-Critic for Road Traffic Optimisation

Current road-traffic optimisation practice around the world is a combination of hand tuned policies with a small degree of automatic adaption. Even state-ofthe-art researchcontrollers need good models of the road traffic, which cannot be obtained directly from existing sensors. We use a policy-gradient reinforcement learningapproach to directly optimise the traffic signals, mapping currently deployed sensor observations to control signals. Our trained controllers are (theoretically) compatiblewith the traffic system used in Sydney and many other cities around the world. We apply two policy-gradient methods: (1) the recent natural actor-critic algorithm, and (2) a vanilla policy-gradient algorithm for comparison. Along the way we extend natural-actor critic approaches to work for distributed and online infinite-horizon problems.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

Country:

Oceania > Australia (0.29)
North America > United States (0.28)

Industry:

Transportation > Infrastructure & Services (1.00)
Transportation > Ground > Road (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.47)

Valizadegan, Hamed, Jin, Rong

Generalized Maximum Margin Clustering and Unsupervised Kernel Learning

Maximum margin clustering was proposed lately and has shown promising performance in recent studies [1, 2]. It extends the theory of support vector machineto unsupervised learning. Despite its good performance, there are three major problems with maximum margin clustering that question its efficiency for real-world applications. First, it is computationally expensive anddifficult to scale to large-scale datasets because the number of parameters in maximum margin clustering is quadratic in the number of examples. Second, it requires data preprocessing to ensure that any clustering boundarywill pass through the origins, which makes it unsuitable for clustering unbalanced dataset. Third, it is sensitive to the choice of kernel functions, and requires external procedure to determine the appropriate values for the parameters of kernel functions. In this paper, we propose "generalized maximum margin clustering" framework that addresses the above three problems simultaneously.

artificial intelligence, machine learning, maximum margin, (16 more...)

Country: North America > United States > Michigan > Ingham County (0.14)

Genre: Research Report (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.35)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.33)

Zhou, Dengyong, Huang, Jiayuan, Schölkopf, Bernhard

Learning with Hypergraphs: Clustering, Classification, and Embedding

We usually endow the investigated objects with pairwise relationships, which can be illustrated as graphs. In many real-world problems, however, relationships among the objects of our interest are more complex than pairwise. Naivelysqueezing the complex relationships into pairwise ones will inevitably lead to loss of information which can be expected valuable for our learning tasks however. Therefore we consider using hypergraphs instead tocompletely represent complex relationships among the objects of our interest, and thus the problem of learning with hypergraphs arises. Our main contribution in this paper is to generalize the powerful methodology of spectral clustering which originally operates on undirected graphs to hypergraphs, andfurther develop algorithms for hypergraph embedding and transductive classification on the basis of the spectral hypergraph clustering approach.Our experiments on a number of benchmarks showed the advantages of hypergraphs over usual graphs.

artificial intelligence, hypergraph, machine learning, (16 more...)

Country: North America > United States (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Roy, Daniel M., Kemp, Charles, Mansinghka, Vikash K., Tenenbaum, Joshua B.

Learning annotated hierarchies from relational data

The objects in many real-world domains can be organized into hierarchies, where each internal node picks out a category of objects. Given a collection of features andrelations defined over a set of objects, an annotated hierarchy includes a specification of the categories that are most useful for describing each individual feature and relation. We define a generative model for annotated hierarchies and the features and relations that they describe, and develop a Markov chain Monte Carlo scheme for learning annotated hierarchies. We show that our model discovers interpretablestructure in several real-world data sets.

artificial intelligence, machine learning, partition, (19 more...)

Country: North America > United States (0.28)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.68)