Iterative Teacher-Aware Learning

Yuan, Luyao, Zhou, Dongruo, Shen, Junhong, Gao, Jingdong, Chen, Jeffrey L., Gu, Quanquan, Wu, Ying Nian, Zhu, Song-Chun

Oct-26-2021–arXiv.org Artificial Intelligence

In human pedagogy, teachers and students can interact adaptively to maximize communication efficiency. The teacher adjusts her teaching method for different students, and the student, after getting familiar with the teacher's instruction mechanism, can infer the teacher's intention to learn faster. Recently, the benefits of integrating this cooperative pedagogy into machine concept learning in discrete spaces have been proved by multiple works. However, how cooperative pedagogy can facilitate machine parameter learning hasn't been thoroughly studied. In this paper, we propose a gradient optimization based teacher-aware learner who can incorporate teacher's cooperative intention into the likelihood function and learn provably faster compared with the naive learning algorithms used in previous machine teaching works. We give theoretical proof that the iterative teacher-aware learning (ITAL) process leads to local and global improvements. We then validate our algorithms with extensive experiments on various tasks including regression, classification, and inverse reinforcement learning using synthetic and real data. We also show the advantage of modeling teacher-awareness when agents are learning from human teachers.

machine learning, reinforcement learning, teaching method, (16 more...)

arXiv.org Artificial Intelligence

Oct-26-2021

arXiv.org PDF

Add feedback

Country:
- North America
  - Canada > Ontario
    - Toronto (0.14)
  - United States
    - California (0.14)
    - Wisconsin (0.14)

Genre:
- Research Report
  - Experimental Study (0.93)
  - New Finding (1.00)

Industry:
- Education > Educational Setting (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Learning Graphical Models
      - Directed Networks > Bayesian Learning (0.46)
      - Undirected Networks > Markov Models (0.46)
    - Neural Networks (0.93)
    - Reinforcement Learning (1.00)
    - Statistical Learning (1.00)
  - Representation & Reasoning
    - Agents (0.93)
    - Uncertainty > Bayesian Inference (0.67)