Learning Task Specifications from Demonstrations
Marcell Vazquez-Chanlatte, Susmit Jha, Ashish Tiwari, Mark K. Ho, Sanjit Seshia
–Neural Information Processing Systems
In many settings (e.g., robotics) demonstrations provide a natural way to specify a task. For example, an agent (e.g., human expert) gives one or more demonstrations of the task from which we seek to automatically synthesize a policy for the robot to execute. Typically, one models the demonstrator as episodically operating within a dynamical system whose transition relation only depends on the current state and action (called the Markov condition). However, even if the dynamics are Markovian, many problems are naturally modeled in non-Markovian terms (see Ex 1).
Neural Information Processing Systems
Nov-20-2025, 17:29:29 GMT
- Country:
- North America
- Canada > Quebec
- Montreal (0.04)
- United States
- California > Alameda County
- Berkeley (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Texas > Travis County
- Austin (0.04)
- California > Alameda County
- Canada > Quebec
- North America
- Technology: