Value Iteration for Learning Concurrently Executable Robotic Control Tasks
Tahmid, Sheikh A., Notomista, Gennaro
–arXiv.org Artificial Intelligence
Many modern robotic systems such as multi-robot systems and manipulators exhibit redundancy, a property owing to which they are capable of executing multiple tasks. This work proposes a novel method, based on the Reinforcement Learning (RL) paradigm, to train redundant robots to be able to execute multiple tasks concurrently. Our approach differs from typical multi-objective RL methods insofar as the learned tasks can be combined and executed in possibly time-varying prioritized stacks. We do so by first defining a notion of task independence between learned value functions. We then use our definition of task independence to propose a cost functional that encourages a policy, based on an approximated value function, to accomplish its control objective while minimally interfering with the execution of higher priority tasks. This allows us to train a set of control policies that can be executed simultaneously. We also introduce a version of fitted value iteration to learn to approximate our proposed cost functional efficiently. We demonstrate our approach on several scenarios and robotic systems.
arXiv.org Artificial Intelligence
Apr-1-2025
- Country:
- Europe > Switzerland (0.04)
- Africa > Togo (0.04)
- North America
- United States > Michigan
- Wayne County > Detroit (0.04)
- Puerto Rico > San Juan
- San Juan (0.04)
- Canada > Ontario
- Waterloo Region > Waterloo (0.04)
- United States > Michigan
- Asia > Middle East
- Jordan (0.04)
- Genre:
- Research Report (0.84)
- Technology: