Reinforcement Learning-based Switching Controller for a Milliscale Robot in a Constrained Environment
Tariverdi, Abbas, Côté-Allard, Ulysse, Mathiassen, Kim, Elle, Ole J., Kalvøy, Håvard, Martinsen, Ørjan G., Tørresen, Jim
–arXiv.org Artificial Intelligence
This work presents a reinforcement learning-based switching control mechanism to autonomously move a ferromagnetic object (representing a milliscale robot) around obstacles within a constrained environment in the presence of disturbances. This mechanism can be used to navigate objects (e.g., capsule endoscopy, swarms of drug particles) through complex environments when active control is a necessity but where direct manipulation can be hazardous. The proposed control scheme consists of a switching control architecture implemented by two sub-controllers. The first sub-controller is designed to employ the robot's inverse kinematic solutions to do an environment search for the to-be-carried ferromagnetic particle while being robust to disturbances. The second sub-controller uses a customized rainbow algorithm to control a robotic arm, i.e., the UR5 robot, to carry a ferromagnetic particle to a desired position through a constrained environment. For the customized Rainbow algorithm, Quantile Huber loss from the Implicit Quantile Networks (IQN) algorithm and ResNet are employed. The proposed controller is first trained and tested in a real-time physics simulation engine (PyBullet). Afterward, the trained controller is transferred to a UR5 robot to remotely transport a ferromagnetic particle in a real-world scenario, achieving a 98.86% success rate over 30 episodes for randomly generated trajectories, demonstrating the viability of the proposed approach for real-life applications. In addition, two classical pathfinding approaches, Attractor Dynamics and the execution extended Rapidly-Exploring Random Trees (ERRT), are also investigated and compared to the RL-based method. The proposed RL-based algorithm is shown to achieve performance comparable to that of the tested classical path planners whilst being more robust to deploy in dynamical environments.
arXiv.org Artificial Intelligence
Apr-29-2023
- Country:
- Asia
- India (0.04)
- Middle East > Iran
- Tehran Province > Tehran (0.04)
- Europe > Norway
- Central Norway > Trøndelag
- Trondheim (0.04)
- Eastern Norway > Oslo (0.06)
- Central Norway > Trøndelag
- North America > Canada
- Quebec (0.04)
- Asia
- Genre:
- Research Report (1.00)
- Industry:
- Health & Medicine
- Diagnostic Medicine > Imaging (0.34)
- Therapeutic Area (1.00)
- Health & Medicine
- Technology: