Learning to Control an Octopus Arm with Gaussian Process Temporal Difference Methods

Apr-6-2023, 15:22:16 GMT–Neural Information Processing Systems

The Octopus arm is a highly versatile and complex limb. How the Octo- pus controls such a hyper-redundant arm (not to mention eight of them!) is as yet unknown. Robotic arms based on the same mechanical prin- ciples may render present day robotic arms obsolete. In this paper, we tackle this control problem using an online reinforcement learning al- gorithm, based on a Bayesian approach to policy evaluation known as Gaussian process temporal difference (GPTD) learning. Our substitute for the real arm is a computer simulation of a 2-dimensional model of an Octopus arm.

artificial intelligence, machine learning, reinforcement learning, (2 more...)

Neural Information Processing Systems

Apr-6-2023, 15:22:16 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)