AITopics

Accumulating data from neurophysiology and neuropsychology have suggested two information processing roles for prefrontal cortex (PFC):1) short-term active memory; and 2) inhibition. We present a new behavioral task and a computational model which were developed in parallel. The task was developed to probe both of these prefrontal functions simultaneously, and produces a rich set of behavioral data that act as constraints on the model. The model is implemented in continuous-time, thus providing a natural framework in which to study the temporal dynamics of processing in the task. We show how the model can be used to examine the behavioral consequencesof neuromodulation in PFC. Specifically, we use the model to make novel and testable predictions regarding the behavioral performance of schizophrenics, who are hypothesized to suffer from reduced dopaminergic tone in this brain area.

neural network, neurology, pfc, (17 more...)

Country: North America > United States (0.47)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

Instance-Based State Identification for Reinforcement Learning

McCallum, R. Andrew

This paper presents instance-based state identification, an approach to reinforcement learning and hidden state that builds disambiguating amountsof short-term memory online, and also learns with an order of magnitude fewer training steps than several previous approaches. Inspiredby a key similarity between learning with hidden state and learning in continuous geometrical spaces, this approach uses instance-based (or "memory-based") learning, a method that has worked well in continuous spaces. 1 BACKGROUND AND RELATED WORK When a robot's next course of action depends on information that is hidden from the sensors because of problems such as occlusion, restricted range, bounded field of view and limited attention, the robot suffers from hidden state. More formally, we say a reinforcement learning agent suffers from the hidden state problem if the agent's state representation is non-Markovian with respect to actions and utility. The hidden state problem arises as a case of perceptual aliasing: the mapping between statesof the world and sensations of the agent is not one-to-one [Whitehead, 1992]. If the agent's perceptual system produces the same outputs for two world states in which different actions are required, and if the agent's state representation consists only of its percepts, then the agent will fail to choose correct actions.

agent, artificial intelligence, reinforcement learning, (12 more...)

Country: Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Tresp, Volker, Neuneier, Ralph, Ahmad, Subutai

Efficient Methods for Dealing with Missing Data in Supervised Learning

Palo Alto, CA 94304 Abstract We present efficient algorithms for dealing with the problem of missing inputs(incomplete feature vectors) during training and recall. Our approach is based on the approximation of the input data distribution usingParzen windows. For recall, we obtain closed form solutions for arbitrary feedforward networks. For training, we show how the backpropagation step for an incomplete pattern can be approximated by a weighted averaged backpropagation step. The complexity of the solutions for training and recall is independent of the number of missing features.

artificial intelligence, neural network, tresp, (17 more...)

Country: North America > United States > California > Santa Clara County > Palo Alto (0.24)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Wolpert, Daniel M., Ghahramani, Zoubin, Jordan, Michael I.

Forward dynamic models in human motor control: Psychophysical evidence

An impedence controlled manipulandum for human movement studies.

artificial intelligence, forward model, neural network, (16 more...)

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.15)

Genre: Research Report (0.94)

Technology: Information Technology > Artificial Intelligence > Cognitive Science (1.00)

A Rigorous Analysis of Linsker-type Hebbian Learning

Feng, J., Pan, H., Roychowdhury, V. P.

His simulations have shown that for appropriate parameter regimes, several structured connection patterns (e.g., centre-surround and oriented afferent receptive fields (aRFs)) occur progressively as the Hebbian evolution of the weights is carried out layer by layer. The behavior of Linsker's model is determined by the underlying nonlinear dynamics which are parameterized by a set of parameters originating from the Hebbian rule and the arbor density of the synapses.

artificial intelligence, attractor, machine learning, (18 more...)

Country: North America > United States > Indiana > Tippecanoe County (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Thrun, Sebastian, Schwartz, Anton

Finding Structure in Reinforcement Learning

Reinforcement learning addresses the problem of learning to select actions in order to maximize one's performance in unknown environments. To scale reinforcement learning to complex real-world tasks, such as typically studied in AI, one must ultimately be able to discover the structure in the world, in order to abstract away the myriad of details and to operate in more tractable problem spaces. This paper presents the SKILLS algorithm. SKILLS discovers skills, which are partially defined action policies that arise in the context of multiple, related tasks.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Country:

North America > United States > California > Santa Clara County (0.14)
Europe > United Kingdom > England (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Harmon, Mance E., III, Leemon C. Baird, Klopf, A. Harry

Advantage Updating Applied to a Differential Game

An application of reinforcement learning to a linear-quadratic, differential game is presented. The reinforcement learning system uses a recently developed algorithm, the residual gradient form of advantage updating. The game is a Markov Decision Process (MDP) with continuous time, states, and actions, linear dynamics, and a quadratic cost function. The game consists of two players, a missile and a plane; the missile pursues the plane and the plane evades the missile. The reinforcement learning algorithm for optimal control is modified for differential games in order to find the minimax point, rather than the maximum. Simulation results are compared to the optimal solution, demonstrating that the simulated reinforcement learning system converges to the optimal answer. The performance of both the residual gradient and non-residual gradient forms of advantage updating and Q-learning are compared. The results show that advantage updating converges faster than Q-learning in all simulations.

algorithm, artificial intelligence, reinforcement learning, (16 more...)

Country:

North America > United States > Massachusetts (0.14)
Europe > United Kingdom > England (0.14)

Industry: Government > Military > Air Force (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Tzonev, Svilen, Schulten, Klaus, Malpeli, Joseph G.

Morphogenesis of the Lateral Geniculate Nucleus: How Singularities Affect Global Structure

The macaque lateral geniculate nucleus (LGN) exhibits an intricate lamination pattern, which changes midway through the nucleus at a point coincident with small gaps due to the blind spot in the retina. We present a three-dimensional model of morphogenesis in which local cell interactions cause a wave of development of neuronal receptive fieldsto propagate through the nucleus and establish two distinct lamination patterns. We examine the interactions between the wave and the localized singularities due to the gaps, and find that the gaps induce the change in lamination pattern. We explore critical factors which determine general LGN organization.

artificial intelligence, health & medicine, transition, (16 more...)

Country:

North America > United States > Illinois > Champaign County > Urbana (0.15)
North America > United States > Illinois > Champaign County > Champaign (0.14)

Industry: Health & Medicine > Therapeutic Area (0.31)

Technology:

Information Technology > Artificial Intelligence > The Future (0.61)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.61)

Krogh, Anders, Vedelsby, Jesper

Neural Network Ensembles, Cross Validation, and Active Learning

It is well known that a combination of many different predictors can improve predictions. Inthe neural networks community "ensembles" of neural networks has been investigated by several authors, see for instance [1, 2, 3]. Most often the networks in the ensemble are trained individually and then their predictions are combined. This combination is usually done by majority (in classification) or by simple averaging (inregression), but one can also use a weighted combination of the networks.

artificial intelligence, generalization error, neural network, (16 more...)

Country:

Europe > Denmark (0.29)
North America > United States > California > San Mateo County > San Mateo (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Cross Validation (0.45)

On the Computational Complexity of Networks of Spiking Neurons

Maass, Wolfgang

We investigate the computational power of a formal model for networks ofspiking neurons, both for the assumption of an unlimited timing precision, and for the case of a limited timing precision. We also prove upper and lower bounds for the number of examples that are needed to train such networks.

artificial intelligence, neural network, neuron, (16 more...)

Country: Europe > Austria (0.15)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.99)