AITopics

Determination of nearly optimalt or at least adequatet regions is left as an additional task that would require that the system dynamics be analyzedt which is not always possible. To address this problemt we move region boundaries adaptively t progressively altering the initial partitioning to a more appropriate representation with no need for a priori knowledge. Unlike previous work (Michiet 1968)t (Bartot 1983)t (Andersont 1982) which used fixed coderSt this approach produces adaptive coders that contract and expand regions/ranges. During adaptationt frequently active regions/ranges contractt reducing the number of situations in which they will be activated, and increasing the chances that neighboring regions will receive input instead. This class of self-organization is discussed in Kohonen (Kohonent 1984)t (Rittert 1986t 1988).

adaptive range, algorithm, boundary, (13 more...)

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.30)
North America > United States > Massachusetts > Hampshire County > Amherst (0.05)
North America > United States > Utah (0.05)
North America > United States > New York (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

A Reinforcement Learning Variant for Control Scheduling

Guha, Aloke

However, a large class of continuous control problems require maintaining the system at a desired operating point, or setpoint, at a given time. We refer to this problem as the basic setpoint control problem [Guha 90], and have shown that reinforcement learning can be used, not surprisingly, quite well for such control tasks.

controller, reinforcement, setpoint, (14 more...)

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > District of Columbia > Washington (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Integrated Modeling and Control Based on Reinforcement Learning and Dynamic Programming

Sutton, Richard S.

This is a summary of results with Dyna, a class of architectures for intelligent systems based on approximating dynamic programming methods. Dyna architectures integrate trial-and-error (reinforcement) learning and execution-time planning into a single process operating alternately on the world and on a learned forward model of the world. We describe and show results for two Dyna architectures, Dyna-AHC and Dyna-Q. Using a navigation task, results are shown for a simple Dyna-AHC system which simultaneously learns by trial and error, learns a world model, and plans optimal routes using the evolving world model. We show that Dyna-Q architectures (based on Watkins's Q-Iearning) are easy to adapt for use in changing environments.

architecture, evaluation function, world model, (14 more...)

Country:

North America > United States > Massachusetts > Middlesex County > Waltham (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Iran (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Navigating through Temporal Difference

Dayan, Peter

Barto, Sutton and Watkins [2] introduced a grid task as a didactic example of temporal difference planning and asynchronous dynamical pre gramming. This paper considers the effects of changing the coding of the input stimulus, and demonstrates that the self-supervised learning of a particular form of hidden unit representation improves performance.

agent, prediction, representation, (16 more...)

Country:

North America > United States > Massachusetts > Hampshire County > Amherst (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
(2 more...)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

A Connectionist Learning Control Architecture for Navigation

Bachrach, Jonathan R.

A novel learning control architecture is used for navigation. A sophisticated test-bed is used to simulate a cylindrical robot with a sonar belt in a planar environment. The task is short-range homing in the presence of obstacles. The robot receives no global information and assumes no comprehensive world model. Instead the robot receives only sensory information which is inherently limited. A connectionist architecture is presented which incorporates a large amount of a priori knowledge in the form of hard-wired networks, architectural constraints, and initial weights. Instead of hard-wiring static potential fields from object models, myarchitecture learns sensor-based potential fields, automatically adjusting them to avoid local minima and to produce efficient homing trajectories. It does this without object models using only sensory information. This research demonstrates the use of a large modular architecture on a difficult task.

architecture, connectionist learning control architecture, robot, (11 more...)

Country:

North America > United States > Massachusetts > Hampshire County > Amherst (0.14)
Asia > Middle East > Jordan (0.06)
North America > United States > California > Santa Clara County > Palo Alto (0.05)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.87)

Thrun, Sebastian, Möller, Knut, Linden, Alexander

Planning with an Adaptive World Model

We present a new connectionist planning method [TML90]. By interaction with an unknown environment, a world model is progressively constructed using gradient descent. For deriving optimal actions with respect to future reinforcement, planning is applied in two steps: an experience network proposes a plan which is subsequently optimized by gradient descent with a chain of world models, so that an optimal reinforcement may be obtained when it is actually run. The appropriateness of this method is demonstrated by a robotics application and a pole balancing task.

experience network, model network, reinforcement, (15 more...)

Country:

North America > United States > Massachusetts > Hampshire County > Amherst (0.14)
North America > United States > California > San Diego County > San Diego (0.05)
North America > United States > District of Columbia > Washington (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.71)

Frye, Robert C., Cummings, Kevin D., Rietman, Edward A.

Proximity Effect Corrections in Electron Beam Lithography Using a Neural Network

We have used a neural network to compute corrections for images written by electron beams to eliminate the proximity effects caused by electron scattering.

correction, neural network, pixel, (12 more...)

Country:

North America > United States > Arizona > Maricopa County > Tempe (0.04)
Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.66)

Katayama, Masazumi, Kawato, Mitsuo

Learning Trajectory and Force Control of an Artificial Muscle Arm by Parallel-hierarchical Neural Network Model

We propose a new parallel-hierarchical neural network model to enable motor learning for simultaneous control of both trajectory and force.

control law, inverse model, trajectory, (12 more...)

Country: Asia > Japan > Honshū > Kansai > Kyoto Prefecture > Kyoto (0.04)

Industry: Health & Medicine (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.63)

Rapidly Adapting Artificial Neural Networks for Autonomous Navigation

Pomerleau, Dean

Dean A. Pomerleau School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213 Abstract The ALVINN (Autonomous Land Vehicle In a Neural Network) project addresses the problem of training artificial neural networks in real time to perform difficult perception tasks. ALVINN,is a back-propagation network that uses inputs from a video camera and an imaging laser rangefinder to drive the CMU Navlab, a modified Chevy van. This paper describes training techniques which allow ALVINN to learn in under 5 minutes to autonomously control the Navlab by watching a human driver's response to new situations. Using these techniques, ALVINN has been trained to drive in a variety of circumstances including single-lane paved and unpaved roads, multilane lined and unlined roads, and obstacle-ridden on-and off-road environments, at speeds of up to 20 miles per hour. 1 INTRODUCTION Previous trainable connectionist perception systems have often ignored important aspects of the form and content of available sensor data. Because of the assumed impracticality of training networks to perform realistic high level perception tasks, connectionist researchers have frequently restricted their task domains to either toy problems (e.g. the TC identification problem [11] [6]) or fixed low level operations (e.g.

alvinn, artificial neural network, neural network, (16 more...)

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.24)
North America > United States > Massachusetts > Suffolk County > Boston (0.05)
North America > United States > California > San Mateo County > San Mateo (0.05)
(3 more...)

Industry: Transportation > Ground > Road (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Tarassenko, Lionel, Brownlow, Michael, Marshall, Gillian, Tombs, Jan, Murray, Alan

Real-time autonomous robot navigation using VLSI neural networks

There have been very few demonstrations ofthe application ofVLSI neural networks to real world problems. Yet there are many signal processing, pattern recognition or optimization problems where a large number of competing hypotheses need to be explored in parallel, most often in real time. The massive parallelism of VLSI neural network devices, with one multiplier circuit per synapse, is ideally suited to such problems. In this paper, we present preliminary results from our design for a real time robot navigation system based on VLSI neural network modules.

navigation, neural network, obstacle, (12 more...)

Country:

North America > United States > California > San Mateo County > San Mateo (0.04)
North America > United States > California > Sacramento County > Sacramento (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Industry: Semiconductors & Electronics (0.95)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)