A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction

Jan-19-2022, 11:20:13 GMT–#artificialintelligence

This paper offers a new hybrid probably approximately correct (PAC) reinforcement learning (RL) algorithm for Markov decision processes (MDPs) that intelligently maintains favorable features of both model-based and model-free methodologies. The designed algorithm, referred to as the Dyna-Delayed Q-learning (DDQ) algorithm, combines model-free Delayed Q-learning and model-based R-max algorithms while outperforming both in most cases. The paper includes a PAC analysis of the DDQ algorithm and a derivation of its sample complexity. Numerical results are provided to support the claim regarding the new algorithm's sample efficiency compared to its parents as well as the best-known PAC model-free and model-based algorithms in application. A real-world experimental implementation of DDQ in the context of pediatric motor rehabilitation facilitated by infant-robot interaction highlights the potential benefits of the reported method.

algorithm, human-robot interaction, hybrid pac reinforcement learning algorithm

#artificialintelligence

Jan-19-2022, 11:20:13 GMT

News Web Page

Add feedback

Industry:
- Health & Medicine > Therapeutic Area > Pediatrics/Neonatology (0.31)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)