AITopics

In many machine learning applications, one has access, not only to training data, but also to some high-level a priori knowledge about the desired behavior of the system. For example, it is known in advance that the output of a character recognizer should be invariant with respect to small spatial distortions of the input images (translations, rotations, scale changes, etcetera). We have implemented a scheme that allows a network to learn the derivative of its outputs with respect to distortion operators of our choosing. This not only reduces the learning time and the amount of training data, but also provides a powerful language for specifying what generalizations we wish the network to perform. 1 INTRODUCTION In machine learning, one very often knows more about the function to be learned than just the training data. An interesting case is when certain directional derivatives of the desired function are known at certain points.

artificial intelligence, neural network, tangent vector, (16 more...)

Country:

North America > Canada > Ontario > Toronto (0.14)
Europe > Switzerland > Zürich > Zürich (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Koistinen, Petri, Holmström, Lasse

Kernel Regression and Backpropagation Training With Noise

One method proposed for improving the generalization capability of a feedforward network trained with the backpropagation algorithm is to use artificial training vectors which are obtained by adding noise to the original training vectors. We discuss the connection of such backpropagation training with noise to kernel density and kernel regression estimation. We compare by simulated examples (1) backpropagation, (2) backpropagation with noise, and (3) kernel regression in mapping estimation and pattern classification contexts.

artificial intelligence, neural network, noise, (15 more...)

Country: Europe > Finland (0.15)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Backpropagation (1.00)

Pouget, Alexandre, Fisher, Stephen A., Sejnowski, Terrence J.

Hierarchical Transformation of Space in the Visual System

Neurons encoding simple visual features in area VI such as orientation, direction of motion and color are organized in retinotopic maps. However, recent physiological experiments have shown that the responses of many neurons in VI and other cortical areas are modulated by the direction of gaze. We have developed a neural network model of the visual cortex to explore the hypothesis that visual features are encoded in headcentered coordinates at early stages of visual processing. New experiments are suggested for testing this hypothesis using electrical stimulations and psychophysical observations.

neural network, neurology, neuron, (20 more...)

Country: North America > United States (0.29)

Genre: Research Report > New Finding (0.87)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.37)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Ji, Chuanyi, Psaltis, Demetri

The VC-Dimension versus the Statistical Capacity of Multilayer Networks

The former characterizes their "Present Address: Department of Electrical Computer and System Engineering, Rensselaer Poly tech Institute, Troy, NY 12180.

artificial intelligence, neural network, vc-dimension, (17 more...)

Country: North America > United States > New York > Rensselaer County > Troy (0.24)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.50)

Scott, Gary M., Shavlik, Jude W., Ray, W. Harmon

Refining PID Controllers using Neural Networks

We apply this method to the task of controlling the outflow and temperature of a water tank, producing statistically-significant gains in accuracy over both a standard neural network approach and a non-learning PID controller. Furthermore, using the PID knowledge to initialize the weights of the network produces statistically less variation in testset accuracy when compared to networks initialized with small random numbers.

artificial intelligence, controller, neural network, (15 more...)

Country:

North America > United States > Wisconsin > Dane County > Madison (0.15)
North America > United States > California > San Mateo County (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Viola, Paul A., Lisberger, Stephen G., Sejnowski, Terrence J.

Recurrent Eye Tracking Network Using a Distributed Representation of Image Motion

This paper briefly describes an artificial neural network for preattentive visual processing. The network is capable of determiuing image motioll in a type of stimulus which defeats most popular methods of motion detect.ion

neural network, neurology, recurrent eye tracking network, (16 more...)

Country:

Europe (0.93)
North America > United States > California > San Francisco County > San Francisco (0.14)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.47)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Extracting and Learning an Unknown Grammar with Recurrent Neural Networks

Giles, C. L., Miller, C. B., Chen, D., Sun, G. Z., Chen, H. H., Lee, Y. C.

We show that similar methods are appropriate for learning unknown grammars from examples of their strings. TIle training algorithm is an incremental real-time, recurrent learning (RTRL) method that computes the complete gradient and updates the weights at the end of each string.

deep learning, grammar, neural network, (15 more...)

Country: North America > United States > California > San Mateo County (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.43)

Darrell, Trevor, Pentland, Alex

Against Edges: Function Approximation with Multiple Support Maps

Networks for reconstructing a sparse or noisy function often use an edge field to segment the function into homogeneous regions, This approach assumes that these regions do not overlap or have disjoint parts, which is often false. For example, images which contain regions split by an occluding object can't be properly reconstructed using this type of network. We have developed a network that overcomes these limitations, using support maps to represent the segmentation of a signal. In our approach, the support of each region in the signal is explicitly represented. Results from an initial implementation demonstrate that this method can reconstruct images and motion sequences which contain complicated occlusion.

approximation, artificial intelligence, fuzzy logic, (18 more...)

Country:

North America > United States > Massachusetts (0.14)
Asia > Japan (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.42)

Zador, Anthony M., Claiborne, Brenda J., Brown, Thomas H.

Nonlinear Pattern Separation in Single Hippocampal Neurons with Active Dendritic Membrane

The cold spot consisted of a high density of a Ca-activated K channel.

cold spot, health & medicine, neural network, (18 more...)

Country: North America > United States > Texas (0.14)

Industry: Health & Medicine (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.32)

Moody, John, Utans, Joachim

Principled Architecture Selection for Neural Networks: Application to Corporate Bond Rating Prediction

The notion of generalization ability can be defined precisely as the prediction risk, the expected performance of an estimator in predicting new observations. In this paper, we propose the prediction risk as a measure of the generalization ability of multi-layer perceptron networks and use it to select an optimal network architecture from a set of possible architectures. We also propose a heuristic search strategy to explore the space of possible architectures. The prediction risk is estimated from the available data; here we estimate the prediction risk by v-fold cross-validation and by asymptotic approximations of generalized cross-validation or Akaike's final prediction error. We apply the technique to the problem of predicting corporate bond ratings. This problem is very attractive as a case study, since it is characterized by the limited availability of the data and by the lack of a complete a priori model which could be used to impose a structure to the network architecture.

banking & finance, input variable, neural network, (13 more...)