Iterative Teacher-Aware Learning
Yuan, Luyao, Zhou, Dongruo, Shen, Junhong, Gao, Jingdong, Chen, Jeffrey L., Gu, Quanquan, Wu, Ying Nian, Zhu, Song-Chun
–arXiv.org Artificial Intelligence
In human pedagogy, teachers and students can interact adaptively to maximize communication efficiency. The teacher adjusts her teaching method for different students, and the student, after getting familiar with the teacher's instruction mechanism, can infer the teacher's intention to learn faster. Recently, the benefits of integrating this cooperative pedagogy into machine concept learning in discrete spaces have been proved by multiple works. However, how cooperative pedagogy can facilitate machine parameter learning hasn't been thoroughly studied. In this paper, we propose a gradient optimization based teacher-aware learner who can incorporate teacher's cooperative intention into the likelihood function and learn provably faster compared with the naive learning algorithms used in previous machine teaching works. We give theoretical proof that the iterative teacher-aware learning (ITAL) process leads to local and global improvements. We then validate our algorithms with extensive experiments on various tasks including regression, classification, and inverse reinforcement learning using synthetic and real data. We also show the advantage of modeling teacher-awareness when agents are learning from human teachers.
arXiv.org Artificial Intelligence
Oct-26-2021
- Country:
- North America
- Canada > Ontario
- Toronto (0.14)
- United States
- California (0.14)
- Wisconsin (0.14)
- Canada > Ontario
- North America
- Genre:
- Research Report
- Experimental Study (0.93)
- New Finding (1.00)
- Research Report
- Industry:
- Education > Educational Setting (0.67)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning
- Learning Graphical Models
- Directed Networks > Bayesian Learning (0.46)
- Undirected Networks > Markov Models (0.46)
- Neural Networks (0.93)
- Reinforcement Learning (1.00)
- Statistical Learning (1.00)
- Learning Graphical Models
- Representation & Reasoning
- Agents (0.93)
- Uncertainty > Bayesian Inference (0.67)
- Machine Learning
- Information Technology > Artificial Intelligence