Reinforcement Learning-based Switching Controller for a Milliscale Robot in a Constrained Environment

Tariverdi, Abbas, Côté-Allard, Ulysse, Mathiassen, Kim, Elle, Ole J., Kalvøy, Håvard, Martinsen, Ørjan G., Tørresen, Jim

Apr-29-2023–arXiv.org Artificial Intelligence

This work presents a reinforcement learning-based switching control mechanism to autonomously move a ferromagnetic object (representing a milliscale robot) around obstacles within a constrained environment in the presence of disturbances. This mechanism can be used to navigate objects (e.g., capsule endoscopy, swarms of drug particles) through complex environments when active control is a necessity but where direct manipulation can be hazardous. The proposed control scheme consists of a switching control architecture implemented by two sub-controllers. The first sub-controller is designed to employ the robot's inverse kinematic solutions to do an environment search for the to-be-carried ferromagnetic particle while being robust to disturbances. The second sub-controller uses a customized rainbow algorithm to control a robotic arm, i.e., the UR5 robot, to carry a ferromagnetic particle to a desired position through a constrained environment. For the customized Rainbow algorithm, Quantile Huber loss from the Implicit Quantile Networks (IQN) algorithm and ResNet are employed. The proposed controller is first trained and tested in a real-time physics simulation engine (PyBullet). Afterward, the trained controller is transferred to a UR5 robot to remotely transport a ferromagnetic particle in a real-world scenario, achieving a 98.86% success rate over 30 episodes for randomly generated trajectories, demonstrating the viability of the proposed approach for real-life applications. In addition, two classical pathfinding approaches, Attractor Dynamics and the execution extended Rapidly-Exploring Random Trees (ERRT), are also investigated and compared to the RL-based method. The proposed RL-based algorithm is shown to achieve performance comparable to that of the tested classical path planners whilst being more robust to deploy in dynamical environments.

machine learning, particle, reinforcement learning, (17 more...)

arXiv.org Artificial Intelligence

Apr-29-2023

arXiv.org PDF

Add feedback

Country:
- Asia
  - India (0.04)
  - Middle East > Iran
    - Tehran Province > Tehran (0.04)
- Europe > Norway
  - Central Norway > Trøndelag
    - Trondheim (0.04)
  - Eastern Norway > Oslo (0.06)
- North America > Canada
  - Quebec (0.04)

Genre:
- Research Report (1.00)

Industry:
- Health & Medicine
  - Diagnostic Medicine > Imaging (0.34)
  - Therapeutic Area (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (1.00)
  - Robots (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found