Online Learning in Repeated Human-Robot Interactions
Babushkin, Vahan (Masdar Institute of Science and Technology) | Oudah, Mayada (Masdar Institute of Science and Technology) | Chenlinangjia, Tennom (Masdar Institute of Science and Technology) | Alshaer, Ahmed (American University of Sharjah) | Crandall, Jacob W. (Masdar Institute of Science and Technology)
Adaptation is a critical component of collaboration. Nevertheless, online learning is not yet used in most successful human-robot interactions, especially when the human's and robot's goals are not fully aligned. There are at least two barriers to the successful application of online learning in HRI. First, typical machine-learning algorithms do not learn at time scales that support effective interactions with people. Algorithms that learn at sufficiently fast time scales often produce myopic strategies that do not lead to good long-term collaborations. Second, random exploration, a core component of most online-learning algorithms, can be problematic for developing collaborative relationships with a human partner. We anticipate that a new genre of online-learning algorithms can overcome these two barriers when paired with (cheap-talk) communication. In this paper, we overview our efforts in these two areas to produce a situation-independent, learning system that quickly learns to collaborate with a human partner.
Nov-1-2014
- Country:
- Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.15)
- Industry:
- Education > Educational Setting > Online (1.00)
- Technology: