AITopics | schaal

Collaborating Authors

schaal

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

NeuralDynamicPolicies forEnd-to-EndSensorimotorLearning

Neural Information Processing SystemsFeb-8-2026, 01:56:59 GMT

The current dominant paradigm in sensorimotor control, whether imitation or reinforcement learning, is to train policies directly in raw action spaces such as torque, joint angle, or end-effector position. This forces the agent to make decision at each point in training, and hence, limit the scalability to continuous, high-dimensional,andlong-horizontasks.Incontrast,researchinclassicalrobotics has, for a long time, exploited dynamical systems as a policy representation to learn robot behaviors via demonstrations.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.50)

Add feedback

Combining Movement Primitives with Contraction Theory

Nah, Moses C., Lachner, Johannes, Hogan, Neville, Slotine, Jean-Jacques

arXiv.org Artificial IntelligenceJan-15-2025

This paper presents a modular framework for motion planning using movement primitives. Central to the approach is Contraction Theory, a modular stability tool for nonlinear dynamical systems. The approach extends prior methods by achieving parallel and sequential combinations of both discrete and rhythmic movements, while enabling independent modulation of each movement. This modular framework enables a divide-and-conquer strategy to simplify the programming of complex robot motion planning. Simulation examples illustrate the flexibility and versatility of the framework, highlighting its potential to address diverse challenges in robot motion planning.

artificial intelligence, dynamical system, trajectory, (18 more...)

arXiv.org Artificial Intelligence

2501.09198

Country: North America > United States > Massachusetts (0.14)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (0.75)

Add feedback

USC and Max Planck: The Double Life of a Top Robotics Researcher

Der Spiegel InternationalMar-23-2018, 07:42:23 GMT

When Stefan Kai Schaal decided to earn more money in the future, he took a leave of absence. It took the researcher more than two years to integrate his new German life seamlessly and inconspicuously into his old American life. Schaal's employer, the University of Southern California (USC) in Los Angeles, was accommodating. It granted the renowned computer scientist the sabbatical in the middle of the semester - a sabbatical he had applied for on the day he was thrown out of his home and his wife filed for divorce after nine years of marriage. That was six years ago.

angeles, artificial intelligence, schaal, (10 more...)

Der Spiegel International

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.28)
Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.08)
South America > Chile (0.05)
(2 more...)

Industry:

Government > Military (0.52)
Government > Regional Government > North America Government > United States Government (0.34)

Technology: Information Technology > Artificial Intelligence > Robots (0.88)

Add feedback

Learning to Select and Generalize Striking Movements in Robot Table Tennis

Muelling, Katharina (Max Planck Institute for Intelligent Systems) | Kober, Jens (Max Planck Institute for Intelligent Systems) | Kroemer, Oliver (Technische Universitaet Darmstadt) | Peters, Jan (Technische Universitaet Darmstadt)

AAAI ConferencesNov-5-2012

Learning new motor tasks autonomously from interaction with a human being is an important goal for both robotics and machine learning. However, when moving beyond basic skills, most monolithic machine learning approaches fail to scale. In this paper, we take the task of learning table tennis as an example and present a new framework which allows a robot to learn cooperative table tennis from interaction with a human. Therefore, the robot first learns a set of elementary table tennis hitting movements from a human teacher by kinesthetic teach-in, which is compiled into a set of dynamical system motor primitives (DMPs). Subsequently, the system generalizes these movements to a wider range of situations using our mixture of motor primitives (MoMP) approach. The resulting policy enables the robot to select appropriate motor primitives as well as to generalize between them. Finally, the robot plays with a human table tennis partner and learns online to improve its behavior.

library, motor primitive, robot, (16 more...)

AAAI Conferences

2012 AAAI Fall Symposium Series

Country:

Europe > Germany > Hesse > Darmstadt Region > Darmstadt (0.04)
Asia > Middle East > Jordan (0.04)
Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.04)

Industry: Leisure & Entertainment > Sports > Tennis (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Training Wheels for the Robot: Learning from Demonstration Using Simulation

Koenig, Nathan (Open Source Robotics Foundation) | Mataric' (University of Southern California) | , Maja

AAAI ConferencesNov-5-2012

Learning from demonstration (LfD) is a promising technique for instructing/teaching autonomous systems based on demonstrations from people who may have little to no experience with robots. An important aspect to LfD is the communication method used to transfer knowledge from an instructor to a robot. The communication method affects the complexity of the demonstration process for instructors, the range of tasks a robot can learn, and the learning algorithm itself. We have designed a graphical interface and an instructional language to provide an intuitive teaching system. The drawback to simplifying the teaching interface is that the resulting demonstration data are less structured, adding complexity to the learning process. This additional complexity is handled through the combination of a minimal set of predefined behaviors and a task representation capable of learning probabilistic policies over a set of behaviors. The predefined behaviors consist of finite actions a robot can perform, which act as building blocks for more complex tasks.

artificial intelligence, demonstration, machine learning, (15 more...)

AAAI Conferences

2012 AAAI Fall Symposium Series

Country:

North America > United States > California > Santa Clara County > Mountain View (0.15)
North America > United States > California > Los Angeles County > Los Angeles (0.15)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.05)

Industry: Education (0.48)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.68)

Add feedback

Reinforcement Learning to Adjust Robot Movements to New Situations

Kober, Jens (Max Planck Institute for Intelligent Systems) | Oztop, Erhan (Advanced Telecommunications Research Institute) | Peters, Jan (Max Planck Institute for Intelligent Systems)

AAAI ConferencesJul-19-2011

Many complex robot motor skills can be represented using elementary movements, and there exist efficient techniques for learning parametrized motor plans using demonstrations and self-improvement. However with current techniques, in many cases, the robot currently needs to learn a new elementary movement even if a parametrized motor plan exists that covers a related situation. A method is needed that modulates the elementary movement through the meta-parameters of its representation. In this paper, we describe how to learn such mappings from circumstances to meta-parameters using reinforcement learning. In particular we use a kernelized version of the reward-weighted regression. We show two robot applications of the presented setup in robotic domains; the generalization of throwing movements in darts, and of hitting movements in table tennis. We demonstrate that both tasks can be learned successfully using simulated and real robots.

learning, motor primitive, reinforcement, (13 more...)

AAAI Conferences

Twenty-Second International Joint Conference on Artificial Intelligence

Country:

Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.14)
Asia > Japan (0.04)

Industry: Leisure & Entertainment > Sports > Tennis (0.37)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.74)
Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (0.50)

Add feedback

Nonparametric Model-Based Reinforcement Learning

Atkeson, Christopher G.

Neural Information Processing SystemsDec-31-1998

This paper describes some of the interactions of model learning algorithms and planning algorithms we have found in exploring model-based reinforcement learning. The paper focuses on how local trajectory optimizers can be used effectively with learned nonparametric models. We find that trajectory planners that are fully consistent with the learned model often have difficulty finding reasonable plans in the early stages of learning. Trajectory planners that balance obeying the learned model with minimizing cost (or maximizing reward) often do better, even if the plan is not fully consistent with the learned model.

dynamic programming, trajectory, value function, (13 more...)

Neural Information Processing Systems

Country: