AITopics

These results show how combinatorial structure can be based on the spatial nature of networks, and not just on their emulation of logical structure.

artificial intelligence, machine learning, representation, (18 more...)

Country:

Oceania > Australia (0.29)
North America > United States > California (0.29)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Towell, Geoffrey, Lehrer, Richard

A Knowledge-Based Model of Geometry Learning

We propose a model of the development of geometric reasoning in children that explicitly involves learning. The model uses a neural network that is initialized with an understanding of geometry similar to that of second-grade children. Through the presentation of a series of examples, the model is shown to develop an understanding of geometry similar to that of fifth-grade children who were trained using similar materials.

educational setting, expert system, neural network, (19 more...)

Country: North America > United States > Wisconsin (0.14)

Genre: Research Report > New Finding (0.69)

Industry: Education > Educational Setting > K-12 Education > Primary School (0.72)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.95)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.66)

Object-Based Analog VLSI Vision Circuits

Koch, Christof, Mathur, Binnal, Liu, Shih-Chii, Harris, John G., Luo, Jin, Sivilotti, Massimo

We describe two successfully working, analog VLSI vision circuits that move beyond pixel-based early vision algorithms. One circuit, implementing the dynamic wires model, provides for dedicated lines of communication among groups of pixels that share a common property. The chip uses the dynamic wires model to compute the arclength of visual contours. Another circuit labels all points inside a given contour with one voltage and all other with another voltage. Itsbehavior is very robust, since small breaks in contours are automatically sealed, providing for Figure-Ground segregation in a noisy environment. Both chips are implemented using networks of resistors and switches and represent a step towards object level processing since a single voltage value encodes the property of an ensemble of pixels.

artificial intelligence, contour, pixel, (14 more...)

Country: North America > United States > California (0.29)

Industry: Semiconductors & Electronics (0.73)

Technology: Information Technology > Artificial Intelligence > Vision (0.47)

Schraudolph, Nicol N., Sejnowski, Terrence J.

Unsupervised Discrimination of Clustered Data via Optimization of Binary Information Gain

We present the information-theoretic derivation of a learning algorithm that clusters unlabelled data with linear discriminants. In contrast to methods that try to preserve information about the input patterns, we maximize the information gained from observing the output of robust binary discriminators implemented with sigmoid nodes. We deri ve a local weight adaptation rule via gradient ascent in this objective, demonstrate its dynamics on some simple data sets, relate our approach to previous work and suggest directions in which it may be extended.

artificial intelligence, neural network, node, (19 more...)

Country: North America > United States > California > San Diego County (0.15)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.35)

Sa, Virginia R. de, Ballard, Dana H.

A Note on Learning Vector Quantization

Vector Quantization is useful for data compression. Competitive Learning whichminimizes reconstruction error is an appropriate algorithm for vector quantization of unlabelled data. Vector quantization of labelled data for classification has a different objective, to minimize the number of misclassifications, and a different algorithm is appropriate. We show that a variant of Kohonen's LVQ2.1 algorithm can be seen as a multiclass extensionof an algorithm which in a restricted 2 class case can be proven to converge to the Bayes optimal classification boundary. We compare the performance of the LVQ2.1 algorithm to that of a modified version having a decreasing window and normalized step size, on a ten class vowel classification problem.

artificial intelligence, codebook vector, machine learning, (15 more...)

Country: North America > United States (0.15)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Zemel, Richard S., Williams, Christopher K. I., Mozer, Michael C.

Directional-Unit Boltzmann Machines

University of Colorado Boulder, CO 80309-0430 Abstract We present a general formulation for a network of stochastic directional units.This formulation is an extension of the Boltzmann machine in which the units are not binary, but take on values in a cyclic range, between 0 and 271' radians. The conditional distribution of a unit's stochastic state is a circular version of the Gaussian probability distribution, known as the von Mises distribution. This combination of a value and a certainty provides additional representational powerin a unit. Many kinds of information can naturally be represented in terms of angular, or directional, variables. A circular range forms a suitable representation for explicitly directional information, such as wind direction, as well as for information where the underlying range is periodic, such as days of the week or months of the year.

artificial intelligence, machine learning, representation, (16 more...)

Country: North America > United States > Colorado > Boulder County > Boulder (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.64)

Orr, Genevieve B., Leen, Todd K.

Weight Space Probability Densities in Stochastic Learning: II. Transients and Basin Hopping Times

Genevieve B. Orr and Todd K. Leen Department of Computer Science and Engineering Oregon Graduate Institute of Science & Technology 19600 N.W. von Neumann Drive Beaverton, OR 97006-1999 Abstract In stochastic learning, weights are random variables whose time evolution is governed by a Markov process. We summarize the theory of the time evolution of P, and give graphical examples of the time evolution that contrast the behavior of stochastic learning with true gradient descent (batch learning). Finally, we use the formalism to obtain predictions of the time required for noise-induced hopping between basins of different optima. We compare the theoretical predictions with simulations of large ensembles of networks for simple problems in supervised and unsupervised learning. Despite the recent application of convergence theorems from stochastic approximation theoryto neural network learning (Oja 1982, White 1989) there remain outstanding questionsabout the search dynamics in stochastic learning.

artificial intelligence, evolution, neural network, (16 more...)

Country: North America > United States > Oregon > Washington County > Beaverton (0.24)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.71)

DeMers, David, Kreutz-Delgado, Kenneth

Global Regularization of Inverse Kinematics for Redundant Manipulators

When m n, we say that the manipulator has redundant degrees--of-freedom (dot). The inverse kinematics problem is the following: given a desired workspace location x, find joint variables 0 such that f(O) x. Even when the forward kinematics is known, 255 256 DeMers and Kreutz-Delgado the inverse kinematics for a manipulator is not generically solvable in closed form (Craig. 1986).

artificial intelligence, manifold, neural network, (17 more...)

Country: North America > United States > California > San Diego County (0.15)

Technology:

Information Technology > Artificial Intelligence > Robots (0.97)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)

LeCun, Yann, Simard, Patrice Y., Pearlmutter, Barak

Automatic Learning Rate Maximization by On-Line Estimation of the Hessian's Eigenvectors

We propose a very simple, and well principled way of computing the optimal step size in gradient descent algorithms. The online version is very efficient computationally, and is applicable to large backpropagation networks trained on large data sets. The main ingredient is a technique for estimating the principal eigenvalue(s) and eigenvector(s) of the objective function's second derivative matrix (Hessian), which does not require to even calculate the Hessian. Several other applications of this technique are proposed for speeding up learning, or for eliminating useless parameters. 1 INTRODUCTION Choosing the appropriate learning rate, or step size, in a gradient descent procedure such as backpropagation, is simultaneously one of the most crucial and expertintensive part of neural-network learning. We propose a method for computing the best step size which is both well-principled, simple, very cheap computationally, and, most of all, applicable to online training with large networks and data sets.

artificial intelligence, eigenvalue, neural network, (14 more...)

Country:

North America > United States > Massachusetts > Hampshire County > Amherst (0.14)
North America > Canada > Ontario > Toronto (0.14)

Industry: Education > Educational Setting > Online (0.54)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.58)

Siu, Kai-Yeung, Roychowdhury, Vwani, Kailath, Thomas

Computing with Almost Optimal Size Neural Networks

Artificial neural networks are comprised of an interconnected collection of certain nonlinear devices; examples of commonly used devices include linear threshold elements, sigmoidal elements and radial-basis elements. We employ results from harmonic analysis and the theory of rational approximation to obtain almost tight lower bounds on the size (i.e.

artificial intelligence, neural network, threshold circuit, (16 more...)

Country:

North America > United States > Indiana > Tippecanoe County (0.14)
North America > United States > California > Orange County > Irvine (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)