Regression via Kirszbraun Extension with Applications to Imitation Learning

Biess, Armin, Kontorovich, Aryeh, Makarychev, Yury, Zaichyk, Hanan

May-28-2019–arXiv.org Machine Learning

Learning by demonstration is a versatile and rapid mechanism for transferring motor skills from a teacher to a learner. A particular challenge in imitation learning is the so-called correspondence problem, which involves mapping actions between a teacher and a learner having substantially different embodiments (say, human to robot). We present a general, model free and non-parametric imitation learning algorithm based on regression between two Hilbert spaces. We accomplish this via Kirszbraun's extension theorem --- apparently the first application of this technique to supervised learning --- and analyze its statistical and computational aspects. We begin by formulating the correspondence problem in terms of quadratically constrained quadratic program (QCQP) regression. Then we describe a procedure for smoothing the training data, which amounts to regularizing hypothesis complexity via its Lipschitz constant. The Lipschitz constant is tuned via a Structural Risk Minimization (SRM) procedure, based on the covering-number risk bounds we derive. We apply our technique to a static posture imitation task between two robotic manipulators with different embodiments, and report promising results.

artificial intelligence, inductive learning, machine learning, (16 more...)

arXiv.org Machine Learning

May-28-2019

arXiv.org PDF

Add feedback

Country:
- Europe > United Kingdom
  - England
    - Cambridgeshire > Cambridge (0.04)
    - Oxfordshire > Oxford (0.04)
- North America
  - Canada > Quebec
    - Montreal (0.04)
  - United States
    - California > Los Angeles County
      - Los Angeles (0.14)
    - Florida > Broward County
      - Fort Lauderdale (0.04)
    - Illinois > Cook County
      - Chicago (0.04)

Genre:
- Research Report (0.40)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Inductive Learning (0.88)
    - Statistical Learning (1.00)
  - Robots (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found