AITopics

Reinforcement learning addresses the problem of learning to select actions in order to maximize one's performance in unknown environments. To scale reinforcement learning to complex real-world tasks, such as typically studied in AI, one must ultimately be able to discover the structure in the world, in order to abstract away the myriad of details and to operate in more tractable problem spaces. This paper presents the SKILLS algorithm. SKILLS discovers skills, which are partially defined action policies that arise in the context of multiple, related tasks.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Country:

North America > United States > California > Santa Clara County (0.14)
Europe > United Kingdom > England (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Harmon, Mance E., III, Leemon C. Baird, Klopf, A. Harry

Advantage Updating Applied to a Differential Game

An application of reinforcement learning to a linear-quadratic, differential game is presented. The reinforcement learning system uses a recently developed algorithm, the residual gradient form of advantage updating. The game is a Markov Decision Process (MDP) with continuous time, states, and actions, linear dynamics, and a quadratic cost function. The game consists of two players, a missile and a plane; the missile pursues the plane and the plane evades the missile. The reinforcement learning algorithm for optimal control is modified for differential games in order to find the minimax point, rather than the maximum. Simulation results are compared to the optimal solution, demonstrating that the simulated reinforcement learning system converges to the optimal answer. The performance of both the residual gradient and non-residual gradient forms of advantage updating and Q-learning are compared. The results show that advantage updating converges faster than Q-learning in all simulations.

algorithm, artificial intelligence, reinforcement learning, (16 more...)

Country:

North America > United States > Massachusetts (0.14)
Europe > United Kingdom > England (0.14)

Industry: Government > Military > Air Force (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Tzonev, Svilen, Schulten, Klaus, Malpeli, Joseph G.

Morphogenesis of the Lateral Geniculate Nucleus: How Singularities Affect Global Structure

The macaque lateral geniculate nucleus (LGN) exhibits an intricate lamination pattern, which changes midway through the nucleus at a point coincident with small gaps due to the blind spot in the retina. We present a three-dimensional model of morphogenesis in which local cell interactions cause a wave of development of neuronal receptive fieldsto propagate through the nucleus and establish two distinct lamination patterns. We examine the interactions between the wave and the localized singularities due to the gaps, and find that the gaps induce the change in lamination pattern. We explore critical factors which determine general LGN organization.

artificial intelligence, health & medicine, transition, (16 more...)

Country:

North America > United States > Illinois > Champaign County > Urbana (0.15)
North America > United States > Illinois > Champaign County > Champaign (0.14)

Industry: Health & Medicine > Therapeutic Area (0.31)

Technology:

Information Technology > Artificial Intelligence > The Future (0.61)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.61)

Krogh, Anders, Vedelsby, Jesper

Neural Network Ensembles, Cross Validation, and Active Learning

It is well known that a combination of many different predictors can improve predictions. Inthe neural networks community "ensembles" of neural networks has been investigated by several authors, see for instance [1, 2, 3]. Most often the networks in the ensemble are trained individually and then their predictions are combined. This combination is usually done by majority (in classification) or by simple averaging (inregression), but one can also use a weighted combination of the networks.

artificial intelligence, generalization error, neural network, (16 more...)

Country:

Europe > Denmark (0.29)
North America > United States > California > San Mateo County > San Mateo (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Cross Validation (0.45)

On the Computational Complexity of Networks of Spiking Neurons

Maass, Wolfgang

We investigate the computational power of a formal model for networks ofspiking neurons, both for the assumption of an unlimited timing precision, and for the case of a limited timing precision. We also prove upper and lower bounds for the number of examples that are needed to train such networks.

artificial intelligence, neural network, neuron, (16 more...)

Country: Europe > Austria (0.15)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.99)

Shultz, Thomas R., Oshima-Takane, Yuriko, Takane, Yoshio

Analysis of Unstandardized Contributions in Cross Connected Networks

Understanding knowledge representations in neural nets has been a difficult problem. Principal components analysis (PCA) of contributions (products of sending activations and connection weights) has yielded valuable insights into knowledge representations, but much of this work has focused on the correlation matrix of contributions. The present work shows that analyzing the variance-covariance matrix of contributions yields more valid insights by taking account of weights.

artificial intelligence, contribution, neural network, (15 more...)

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > Canada > Quebec > Montreal (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.88)

Williams, Christopher K. I., Revow, Michael, Hinton, Geoffrey E.

Using a neural net to instantiate a deformable model

Deformable models are an attractive approach to recognizing nonrigid objects which have considerable within class variability. However, there are severe search problems associated with fitting the to data. We show that by using neural networks to providemodels better starting points, the search time can be significantly reduced. The method is demonstrated on a character recognition task.

artificial intelligence, instantiation parameter, neural network, (16 more...)

Country:

North America > United States (0.46)
North America > Canada > Ontario > Toronto (0.15)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Spector, Kalanit Grill, Edelman, Shimon, Malach, Rafael

Anatomical origin and computational role of diversity in the response properties of cortical neurons

A fundamental feature of cortical architecture is its columnar organization, manifested in the tendency of neurons with similar properties to be organized in columns that run perpendicular to the cortical surface. This organization of the cortex was initially discovered by physiological experiments (Mouncastle, 1957; Hubel and Wiesel, 1962), and subsequently confirmed with the demonstration of histologically defined that axonal projections throughout thecolumns. Tracing experiments have shown tend to be organized in vertically aligned clusters or patches.

diversity, health & medicine, neurology, (19 more...)

Country: North America > United States (0.28)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.48)

Technology: Information Technology > Artificial Intelligence (1.00)

The AI's Half-Century

Boden, Margaret A.

AI MagazineDec-15-1995

"How We Know Universals: The Perception Their first paper made many intellectual waves--which are still spreading, 50 years later. They had claimed that the truth or falsity of any (computable) proposition could, in with AI, for it's difficult to say just principle, be computed by a simple type of The future of psychology, they good a date as any, however, is 1943--almost said, consisted of the design of various sorts exactly half a century ago. This In that year, Warren McCulloch (a psychiatrist, novel methodology, and the nascent technology cybernetician, philosopher, and poet) associated with it, promised to show just and Walter Pitts (a research student in mathematics) how mind is grounded in mechanism. Much of this was "logical" in nature result was a heady brew, which explicitly and developed into what's known as classical, promised to revolutionize psychology and or symbolic, AI. But some was what is nowadays philosophy--and which, in the event, revolutionized called connectionist, studying networks technology too. In the late 1980s, however, it McCulloch and Pitts' paper ("A Logical Calculus blossomed--hitting the newsstands with of the Ideas Immanent in Nervous rash promises of "brainlike" computers just Activity") concentrated on how propositions around the corner. But both these forms of AI expressible in logic could be computed by share the same historical roots. Those nets consisted of So much for pedigree. But does a mere halfcentury cells passing inhibitory and excitatory messages of work count as a pedigree? Might it between them and acting as what computer rather be a mere blip, an unfortunate academic scientists (soon afterwards) called "and-mutation with no real intellectual fitness?

neural network, neurology, representation, (20 more...)

AI Magazine

Country:

Europe > United Kingdom > England (0.15)
North America > United States (0.14)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.72)