Online Learning in Repeated Human-Robot Interactions

Babushkin, Vahan (Masdar Institute of Science and Technology) | Oudah, Mayada (Masdar Institute of Science and Technology) | Chenlinangjia, Tennom (Masdar Institute of Science and Technology) | Alshaer, Ahmed (American University of Sharjah) | Crandall, Jacob W. (Masdar Institute of Science and Technology)

AAAI Conferences 

Adaptation is a critical component of collaboration. Nevertheless, online learning is not yet used in most successful human-robot interactions, especially when the human's and robot's goals are not fully aligned. There are at least two barriers to the successful application of online learning in HRI. First, typical machine-learning algorithms do not learn at time scales that support effective interactions with people. Algorithms that learn at sufficiently fast time scales often produce myopic strategies that do not lead to good long-term collaborations. Second, random exploration, a core component of most online-learning algorithms, can be problematic for developing collaborative relationships with a human partner. We anticipate that a new genre of online-learning algorithms can overcome these two barriers when paired with (cheap-talk) communication. In this paper, we overview our efforts in these two areas to produce a situation-independent, learning system that quickly learns to collaborate with a human partner.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found