AITopics | Calinon, Sylvain

Plotting

Calinon, Sylvain

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

A survey on policy search algorithms for learning robot controllers in a handful of trials

Chatzilygeroudis, Konstantinos, Vassiliades, Vassilis, Stulp, Freek, Calinon, Sylvain, Mouret, Jean-Baptiste

arXiv.org Machine LearningJul-11-2018

Most policy search algorithms require thousands of training episodes to find an effective policy, which is often infeasible with a physical robot. This survey article focuses on the extreme other end of the spectrum: how can a robot adapt with only a handful of trials (a dozen) and a few minutes? By analogy with the word "big-data", we refer to this challenge as "micro-data reinforcement learning". We show that a first strategy is to leverage prior knowledge on the policy structure (e.g., dynamic movement primitives), on the policy parameters (e.g., demonstrations), or on the dynamics (e.g., simulators). A second strategy is to create data-driven surrogate models of the expected reward (e.g., Bayesian optimization) or the dynamical model (e.g., model-based policy search), so that the policy optimizer queries the model instead of the real system. Overall, all successful micro-data algorithms combine these two strategies by varying the kind of model and prior knowledge. The current scientific challenges essentially revolve around scaling up to complex robots (e.g., humanoids), designing generic priors, and optimizing the computing time.

air transportation, deep learning, policy search, (21 more...)

arXiv.org Machine Learning

1807.02303

Country:

Europe (0.92)
Asia > Japan (0.28)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Energy > Oil & Gas (0.68)
Transportation > Air (0.46)
Leisure & Entertainment > Sports (0.46)
Leisure & Entertainment > Games (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
(4 more...)

Add feedback

A Skill Transfer Approach for Continuum Robots — Imitation of Octopus Reaching Motion with the STIFF-FLOP Robot

Malekzadeh, Milad S. (Istituto Italiano di Tecnologia (IIT)) | Calinon, Sylvain (Idiap Research Institute and Istituto Italiano di Tecnologia (IIT)) | Bruno, Danilo (Istituto Italiano di Tecnologia (IIT)) | Caldwell, Darwin G. (Istituto Italiano di Tecnologia (IIT))

AAAI ConferencesNov-1-2014

The problem of transferring skills to hyper-redundant system requires the design of new motion primitive representations that can cope with multiple sources of noise and redundancy, and that can dynamically handle perturbations in the environment. One way is to take inspiration from invertebrate systems in nature to seek for new versatile representations of motion/behavior primitives for continuum robots. In particular, the incredibly varied skills achieved by the octopus can guide us toward the design of such robust encoding scheme. This abstract presents our ongoing work that aims at combining statistical machine learning, dynamical systems and stochastic optimization to study the problem of transferring skills to a flexible surgical robot (STIFF-FLOP) composed of 2 modules with constant curvatures. The approach is tested in simulation by imitation and self-refinement of an octopus reaching motion.

artificial intelligence, machine learning, representation, (16 more...)

AAAI Conferences

2014 AAAI Fall Symposium Series

Country: Europe (0.14)

Industry: Education (0.52)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Add feedback

Learning Collaborative Impedance-Based Robot Behaviors

Rozo, Leonel Dario (Istituto Italiano di Tecnologia) | Calinon, Sylvain (Istituto Italiano di Tecnologia) | Caldwell, Darwin (Istituto Italiano di Tecnologia) | Jimenez, Pablo (Researcher, Institut de Robotica i Informatica Industrial) | Torras, Carme (Institut de Robotica i Informatica Industrial)

AAAI ConferencesJul-9-2013

Research in learning from demonstration has focused on transferring movements from humans to robots. However, a need is arising for robots that do not just replicate the task on their own, but that also interact with humans in a safe and natural way to accomplish tasks cooperatively. Robots with variable impedance capabilities opens the door to new challenging applications, where the learning algorithms must be extended by encapsulating force and vision information. In this paper we propose a framework to transfer impedance-based behaviors to a torque-controlled robot by kinesthetic teaching. The proposed model encodes the examples as a task-parameterized statistical dynamical system, where the robot impedance is shaped by estimating virtual stiffness matrices from the set of demonstrations. A collaborative assembly task is used as testbed. The results show that the model can be used to modify the robot impedance along task execution to facilitate the collaboration, by triggering stiff and compliant behaviors in an on-line manner to adapt to the user's actions.

learning collaborative impedance-based robot behavior

AAAI Conferences

Twenty-Seventh AAAI Conference on Artificial Intelligence

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Bayesian Nonparametric Multi-Optima Policy Search in Reinforcement Learning

Bruno, Danilo (Istituto Italiano di Tecnologia (IIT)) | Calinon, Sylvain (Istituto Italiano di Tecnologia (IIT)) | Caldwell, Darwin G. (Istituto Italiano di Tecnologia (IIT))

AAAI ConferencesJul-9-2013

Skills can often be performed in many different ways. In order to provide robots with human-like adaptation capabilities, it is of great interest to learn several ways of achieving the same skills in parallel, since eventual changes in the environment or in the robot can make some solutions unfeasible. In this case, the knowledge of multiple solutions can avoid relearning the task. This problem is addressed in this paper within the framework of Reinforcement Learning, as the automatic determination of multiple optimal parameterized policies. For this purpose, a model handling a variable number of policies is built using a Bayesian non-parametric approach. The algorithm is first compared to single policy algorithms on known benchmarks. It is then applied to a typical robotic problem presenting multiple solutions.

bayesian nonparametric multi-optima policy search, reinforcement learning

AAAI Conferences

Twenty-Seventh AAAI Conference on Artificial Intelligence

Technology:

Information Technology > Artificial Intelligence > Robots (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.60)

Add feedback