Obstacle Avoidance through Reinforcement Learning

Apr-6-2023, 19:23:12 GMT–Neural Information Processing Systems

A method is described for generating plan-like. The experiments reported here use a simulated vehicle with a primitive range sensor. Avoidance behaviour is encoded as a set of continuous functions of the perceptual input space. These functions are stored using CMACs and trained by a variant of Barto and Sutton's adaptive critic algorithm. As the vehicle explores its surroundings it adapts its responses to sensory stimuli so as to minimise the negative reinforcement arising from collisions.

collision, obstacle avoidance, reinforcement learning

Neural Information Processing Systems

Apr-6-2023, 19:23:12 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Planning & Scheduling (0.48)
  - Machine Learning > Reinforcement Learning (0.40)