Interactive Learning from Activity Description
Nguyen, Khanh, Misra, Dipendra, Schapire, Robert, Dudík, Miro, Shafto, Patrick
–arXiv.org Artificial Intelligence
We present a novel interactive learning protocol that enables training request-fulfilling agents by verbally describing their activities. Our protocol gives rise to a new family of interactive learning algorithms that offer complementary advantages against traditional algorithms like imitation learning (IL) and reinforcement learning (RL). We develop an algorithm that practically implements this protocol and employ it to train agents in two challenging request-fulfilling problems using purely language-description feedback. Empirical results demonstrate the strengths of our algorithm: compared to RL baselines, it is more sample-efficient; compared to IL baselines, it achieves competitive success rates while not requiring feedback providers to have agent-specific expertise. We also provide theoretical guarantees of the algorithm under certain assumptions on the teacher and the environment.
arXiv.org Artificial Intelligence
Feb-13-2021
- Country:
- Oceania > Australia
- North America
- United States
- Texas > Travis County
- Austin (0.04)
- New York > New York County
- New York City (0.04)
- New Jersey > Essex County
- Newark (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Maryland > Prince George's County
- College Park (0.14)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California > San Diego County
- San Diego (0.04)
- Texas > Travis County
- Canada
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- United States
- Europe
- Germany > Berlin (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Italy > Tuscany
- Florence (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Bulgaria > Sofia City Province
- Sofia (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia > Middle East
- Jordan (0.04)
- Genre:
- Research Report > New Finding (0.87)
- Industry:
- Education > Educational Setting > Online (1.00)
- Technology: