A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction