AITopics

Country:

North America > United States (0.04)
Europe > United Kingdom (0.04)
Europe > Denmark > Capital Region > Copenhagen (0.04)

Industry: Education (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.62)

Active Learning in Multilayer Perceptrons

Fukumizu, Kenji

We propose an active learning method with hidden-unit reduction.

algorithm, information matrix, learning, (11 more...)

Country:

Asia > Japan > Honshū > Kantō > Kanagawa Prefecture > Yokohama (0.05)
North America > United States > California > San Mateo County > San Mateo (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.91)

Marchand, Mario, Hadjifaradji, Saeed

Strong Unimodality and Exact Learning of Constant Depth µ-Perceptron Networks

strong unimodality and exact learning

Abstract Missing

Country:

Europe > United Kingdom (0.04)
Asia > Middle East > UAE (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.40)

Optimization Principles for the Neural Code

DeWeese, Michael

Recent experiments show that the neural codes at work in a wide range of creatures share some common features. At first sight, these observations seem unrelated. However, we show that these features arise naturally in a linear filtered threshold crossing (LFTC) model when we set the threshold to maximize the transmitted information. This maximization process requires neural adaptation to not only the DC signal level, as in conventional light and dark adaptation, but also to the statistical structure of the signal and noise distributions. We also present a new approach for calculating the mutual information between a neuron's output spike train and any aspect of its input signal which does not require reconstruction of the input signal. This formulation is valid provided the correlations in the spike train are small, and we provide a procedure for checking this assumption. This paper is based on joint work (DeWeese [1], 1995). Preliminary results from the LFTC model appeared in a previous proceedings (DeWeese [2], 1995), and the conclusions we reached at that time have been reaffirmed by further analysis of the model.

information, noise, spike train, (16 more...)

Country:

North America > United States > New York (0.04)
North America > United States > California > San Diego County > La Jolla (0.04)

Genre: Research Report (0.48)

Technology: Information Technology > Artificial Intelligence (0.47)

Mansour, Yishay, Sahar, Sigal

Implementation Issues in the Fourier Transform Algorithm

Over the last few years the Fourier Transform (FT) representation of boolean functions has been an instrumental tool in the computational learning theory community. It has been used mainly to demonstrate the learnability of various classes of functions with respect to the uniform distribution.

algorithm, coefficient, hypothesis, (16 more...)

Country: Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.05)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.69)
Information Technology > Data Science > Data Quality > Data Transformation (0.63)
Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.55)

Coolen, A.C.C., Laughton, S. N., Sherrington, D.

Modern Analytic Techniques to Solve the Dynamics of Recurrent Neural Networks

We describe the use of modern analytical techniques in solving the dynamics of symmetric and nonsymmetric recurrent neural networks near saturation. These explicitly take into account the correlations between the post-synaptic potentials, and thereby allow for a reliable prediction of transients. 1 INTRODUCTION Recurrent neural networks have been rather popular in the physics community, because they lend themselves so naturally to analysis with tools from equilibrium statistical mechanics. This was the main theme of physicists between, say, 1985 and 1990. Less familiar to the neural network community is a subsequent wave of theoretical physical studies, dealing with the dynamics of symmetric and nonsymmetric recurrent networks. The strategy here is to try to describe the processes at a reduced level of an appropriate small set of dynamic macroscopic observables.

analytic technique, sherrington, simulation, (14 more...)

Country:

North America > United States > New Mexico > Los Alamos County > Los Alamos (0.05)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.05)
Europe > Netherlands > North Holland > Amsterdam (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.83)

Bohossian, Vasken, Bruck, Jehoshua

On Neural Networks with Minimal Weights

Linear threshold elements are the basic building blocks of artificial neural networks. A linear threshold element computes a function that is a sign of a weighted sum of the input variables. The weights are arbitrary integers; actually, they can be very big integers-exponential in the number of the input variables. However, in practice, it is difficult to implement big weights. In the present literature a distinction is made between the two extreme cases: linear threshold functions with polynomial-size weights as opposed to those with exponential-size weights.

linear threshold function, vector, weight vector, (14 more...)

Country:

North America > United States > California > Santa Clara County > Stanford (0.04)
North America > United States > California > Santa Clara County > San Jose (0.04)
North America > United States > California > Los Angeles County > Pasadena (0.04)
Europe > Switzerland > Vaud > Lausanne (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Snapp, Robert R., Xu, Tong

Estimating the Bayes Risk from Sample Data

In this setting, each pattern, represented as an n-dimensional feature vector, is associated with a discrete pattern class, or state of nature (Duda and Hart, 1973). Using available information, (e.g., a statistically representative set of labeled feature vectors

bayes risk, classification problem, classifier, (14 more...)

Country:

North America > United States > Vermont > Chittenden County > Burlington (0.14)
North America > United States > New York > New York County > New York City (0.05)
North America > United States > Texas (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.47)

Stable Dynamic Parameter Adaption

Rüger, Stefan M.

A stability criterion for dynamic parameter adaptation is given. In the case of the learning rate of backpropagation, a class of stable algorithms is presented and studied, including a convergence proof.

algorithm, backpropagation, gradient, (14 more...)

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)
Europe > Germany > Berlin (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)

Koiran, Pascal, Sontag, Eduardo D.

Neural Networks with Quadratic VC Dimension

A set of labeled training samples is provided, and a network must be obtained which is then expected to correctly classify previously unseen inputs. In this context, a central problem is to estimate the amount of training data needed to guarantee satisfactory learning performance. To study this question, it is necessary to first formalize the notion of learning from examples. One such formalization is based on the paradigm of probably approximately correct (PAC) learning, due to Valiant (1984). In this framework, one starts by fitting some function /, chosen from a predetermined class F, to the given training data. The class F is often called the "hypothesis class", and for purposes of this discussion it will be assumed that the functions in F take binary values {O, I} and are defined on a common domain X.

architecture, dimension, vc dimension, (16 more...)