AITopics | Mansard, Nicolas

Collaborating Authors

Mansard, Nicolas

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Control of Humanoid Robots with Parallel Mechanisms using Kinematic Actuation Models

Lutz, Victor, de Matteïs, Ludovic, Batto, Virgile, Mansard, Nicolas

arXiv.org Artificial IntelligenceMar-28-2025

Inspired by the mechanical design of Cassie, several recently released humanoid robots are using actuator configuration in which the motor is displaced from the joint location to optimize the leg inertia. This in turn induces a non linearity in the reduction ratio of the transmission which is often neglected when computing the robot motion (e.g. by trajectory optimization or reinforcement learning) and only accounted for at control time. This paper proposes an analytical method to efficiently handle this non-linearity. Using this actuation model, we demonstrate that we can leverage the dynamic abilities of the non-linear transmission while only modeling the inertia of the main serial chain of the leg, without approximating the motor capabilities nor the joint range. Based on analytical inverse kinematics, our method does not need any numerical routines dedicated to the closed-kinematics actuation, hence leading to very efficient computations. Our study focuses on two mechanisms widely used in recent humanoid robots; the four bar knee linkage as well as a parallel 2 DoF ankle mechanism. We integrate these models inside optimization based (DDP) and learning (PPO) control approaches. A comparison of our model against a simplified model that completely neglects closed chains is then shown in simulation.

artificial intelligence, derivative, mechanism, (16 more...)

arXiv.org Artificial Intelligence

2503.22459

Country:

Europe > France (0.47)
North America > United States (0.46)

Genre: Research Report (0.40)

Industry: Energy (0.32)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Infinite-Horizon Value Function Approximation for Model Predictive Control

Jordana, Armand, Kleff, Sébastien, Haffemayer, Arthur, Ortiz-Haro, Joaquim, Carpentier, Justin, Mansard, Nicolas, Righetti, Ludovic

arXiv.org Artificial IntelligenceFeb-10-2025

Model Predictive Control has emerged as a popular tool for robots to generate complex motions. However, the real-time requirement has limited the use of hard constraints and large preview horizons, which are necessary to ensure safety and stability. In practice, practitioners have to carefully design cost functions that can imitate an infinite horizon formulation, which is tedious and often results in local minima. In this work, we study how to approximate the infinite horizon value function of constrained optimal control problems with neural networks using value iteration and trajectory optimization. Furthermore, we demonstrate how using this value function approximation as a terminal cost provides global stability to the model predictive controller. The approach is validated on two toy problems and a real-world scenario with online obstacle avoidance on an industrial manipulator where the value function is conditioned to the goal and obstacle.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

arXiv.org Artificial Intelligence

2502.0676

Country:

Europe > France (0.28)
North America > United States (0.28)

Genre: Research Report (0.82)

Industry: Energy > Oil & Gas > Upstream (0.62)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)
(3 more...)

Add feedback

Reinforcement Learning from Wild Animal Videos

Chane-Sane, Elliot, Roux, Constant, Stasse, Olivier, Mansard, Nicolas

arXiv.org Artificial IntelligenceDec-5-2024

We propose to learn legged robot locomotion skills by watching thousands of wild animal videos from the internet, such as those featured in nature documentaries. Indeed, such videos offer a rich and diverse collection of plausible motion examples, which could inform how robots should move. To achieve this, we introduce Reinforcement Learning from Wild Animal Videos (RLWAV), a method to ground these motions into physical robots. We first train a video classifier on a large-scale animal video dataset to recognize actions from RGB clips of animals in their natural habitats. We then train a multi-skill policy to control a robot in a physics simulator, using the classification score of a third-person camera capturing videos of the robot's movements as a reward for reinforcement learning. Finally, we directly transfer the learned policy to a real quadruped Solo. Remarkably, despite the extreme gap in both domain and embodiment between animals in the wild and robots, our approach enables the policy to learn diverse skills such as walking, jumping, and keeping still, without relying on reference trajectories nor skill-specific rewards.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

arXiv.org Artificial Intelligence

2412.04273

Country: Europe > France (0.14)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

SoloParkour: Constrained Reinforcement Learning for Visual Locomotion from Privileged Experience

Chane-Sane, Elliot, Amigo, Joseph, Flayols, Thomas, Righetti, Ludovic, Mansard, Nicolas

arXiv.org Artificial IntelligenceSep-20-2024

Parkour poses a significant challenge for legged robots, requiring navigation through complex environments with agility and precision based on limited sensory inputs. In this work, we introduce a novel method for training end-to-end visual policies, from depth pixels to robot control commands, to achieve agile and safe quadruped locomotion. We formulate robot parkour as a constrained reinforcement learning (RL) problem designed to maximize the emergence of agile skills within the robot's physical limits while ensuring safety. We first train a policy without vision using privileged information about the robot's surroundings. We then generate experience from this privileged policy to warm-start a sample efficient off-policy RL algorithm from depth images. This allows the robot to adapt behaviors from this privileged experience to visual locomotion while circumventing the high computational costs of RL directly from pixels. We demonstrate the effectiveness of our method on a real Solo-12 robot, showcasing its capability to perform a variety of parkour skills such as walking, climbing, leaping, and crawling.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

arXiv.org Artificial Intelligence

2409.13678

Country:

North America > United States (0.28)
Europe > France > Occitanie (0.14)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Parallel and Proximal Constrained Linear-Quadratic Methods for Real-Time Nonlinear MPC

Jallet, Wilson, Dantec, Ewen, Arlaud, Etienne, Carpentier, Justin, Mansard, Nicolas

arXiv.org Artificial IntelligenceJun-3-2024

Recent strides in nonlinear model predictive control (NMPC) underscore a dependence on numerical advancements to efficiently and accurately solve large-scale problems. Given the substantial number of variables characterizing typical whole-body optimal control (OC) problems - often numbering in the thousands - exploiting the sparse structure of the numerical problem becomes crucial to meet computational demands, typically in the range of a few milliseconds. Addressing the linear-quadratic regulator (LQR) problem is a fundamental building block for computing Newton or Sequential Quadratic Programming (SQP) steps in direct optimal control methods. This paper concentrates on equality-constrained problems featuring implicit system dynamics and dual regularization, a characteristic of advanced interiorpoint or augmented Lagrangian solvers. Here, we introduce a parallel algorithm for solving an LQR problem with dual regularization. Leveraging a rewriting of the LQR recursion through block elimination, we first enhanced the efficiency of the serial algorithm and then subsequently generalized it to handle parametric problems. This extension enables us to split decision variables and solve multiple subproblems concurrently. Our algorithm is implemented in our nonlinear numerical optimal control library ALIGATOR. It showcases improved performance over previous serial formulations and we validate its efficacy by deploying it in the model predictive control of a real quadruped robot.

algorithm, artificial intelligence, optimization problem, (17 more...)

arXiv.org Artificial Intelligence

2405.09197

Country:

Asia (0.93)
Europe > France (0.46)
North America > United States (0.28)

Genre: Research Report (0.50)

Industry:

Energy > Oil & Gas > Upstream (0.54)
Energy > Oil & Gas > Downstream (0.41)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)

Add feedback

CaT: Constraints as Terminations for Legged Locomotion Reinforcement Learning

Chane-Sane, Elliot, Leziart, Pierre-Alexandre, Flayols, Thomas, Stasse, Olivier, Souères, Philippe, Mansard, Nicolas

arXiv.org Artificial IntelligenceMar-27-2024

Deep Reinforcement Learning (RL) has demonstrated impressive results in solving complex robotic tasks such as quadruped locomotion. Yet, current solvers fail to produce efficient policies respecting hard constraints. In this work, we advocate for integrating constraints into robot learning and present Constraints as Terminations (CaT), a novel constrained RL algorithm. Departing from classical constrained RL formulations, we reformulate constraints through stochastic terminations during policy learning: any violation of a constraint triggers a probability of terminating potential future rewards the RL agent could attain. We propose an algorithmic approach to this formulation, by minimally modifying widely used off-the-shelf RL algorithms in robot learning (such as Proximal Policy Optimization). Our approach leads to excellent constraint adherence without introducing undue complexity and computational overhead, thus mitigating barriers to broader adoption. Through empirical evaluation on the real quadruped robot Solo crossing challenging obstacles, we demonstrate that CaT provides a compelling solution for incorporating constraints into RL frameworks. Videos and code are available at https://constraints-as-terminations.github.io.

constraint, machine learning, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

2403.18765

Country: Europe > France > Occitanie (0.14)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Robots > Locomotion (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Visually Guided Model Predictive Robot Control via 6D Object Pose Localization and Tracking

Fourmy, Mederic, Priban, Vojtech, Behrens, Jan Kristof, Mansard, Nicolas, Sivic, Josef, Petrik, Vladimir

arXiv.org Artificial IntelligenceNov-9-2023

The objective of this work is to enable manipulation tasks with respect to the 6D pose of a dynamically moving object using a camera mounted on a robot. Examples include maintaining a constant relative 6D pose of the robot arm with respect to the object, grasping the dynamically moving object, or co-manipulating the object together with a human. Fast and accurate 6D pose estimation is crucial to achieve smooth and stable robot control in such situations. The contributions of this work are three fold. First, we propose a new visual perception module that asynchronously combines accurate learning-based 6D object pose localizer and a high-rate model-based 6D pose tracker. The outcome is a low-latency accurate and temporally consistent 6D object pose estimation from the input video stream at up to 120 Hz. Second, we develop a visually guided robot arm controller that combines the new visual perception module with a torque-based model predictive control algorithm. Asynchronous combination of the visual and robot proprioception signals at their corresponding frequencies results in stable and robust 6D object pose guided robot arm control. Third, we experimentally validate the proposed approach on a challenging 6D pose estimation benchmark and demonstrate 6D object pose-guided control with dynamically moving objects on a real 7 DoF Franka Emika Panda robot.

artificial intelligence, tracker, video understanding, (16 more...)

arXiv.org Artificial Intelligence

2311.05344

Country: Europe (0.68)

Genre: Research Report (0.50)

Industry:

Government (0.48)
Energy > Oil & Gas (0.37)

Technology:

Information Technology > Artificial Intelligence > Vision > Video Understanding (0.77)
Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (0.75)

Add feedback

Perceptive Locomotion through Whole-Body MPC and Optimal Region Selection

Corbères, Thomas, Mastalli, Carlos, Merkt, Wolfgang, Havoutis, Ioannis, Fallon, Maurice, Mansard, Nicolas, Flayols, Thomas, Vijayakumar, Sethu, Tonneau, Steve

arXiv.org Artificial IntelligenceMay-15-2023

Abstract--Real-time synthesis of legged locomotion maneuvers in challenging industrial settings is still an open problem, requiring simultaneous determination of footsteps locations several steps ahead while generating whole-body motions close to the robot's limits. State estimation and perception errors impose the practical constraint of fast re-planning motions in a model predictive control (MPC) framework. We first observe that the computational limitation of perceptive locomotion pipelines lies in the combinatorics of contact surface selection. Re-planning contact locations on selected surfaces can be accomplished at MPC frequencies (50-100 Hz). Then, whole-body motion generation typically follows a reference trajectory for the robot base to facilitate convergence. Our contributions are integrated into a complete framework for perceptive locomotion, validated under diverse terrain conditions, and demonstrated in challenging trials that push the robot's actuation limits, as well as in the ICRA 2023 quadruped challenge simulation. ELIABLE and autonomous locomotion for legged robots in arbitrary environments is a longstanding challenge. A. State of the art The hardware maturity of quadruped robots [1], [2], [3], [4] The mathematical complexity of the legged locomotion motivates the development of a motion synthesis framework problem in arbitrary environments is such that an undesired for applications including inspections in industrial areas [5]. Typically, a contact plan describing the contact handling the issues of contact decision (where should the robot locations is first computed, assumed to be feasible, and provided step?) and Whole-Body Model Predictive Control (WB-MPC) as input to a WB-MPC framework to generate wholebody of the robot (what motion creates the contact?). As a result, the contact decision Each contact decision defines high-dimensional, non-linear must be made using an approximated robot model, under the geometric and dynamic constraints on the WB-MPC that uncertainty that results from imperfect perception and state prevent a trivial decoupling of the two issues: How to prove estimation. The complexity of the approximated model has, that a contact plan is valid without finding a feasible wholebody unsurprisingly, a strong correlation with the versatility and motion to achieve it?

artificial intelligence, robot, trajectory, (17 more...)

arXiv.org Artificial Intelligence

2305.08926

Country:

Europe (1.00)
North America > United States > California (0.28)

Genre: Research Report (0.64)

Industry: Energy > Oil & Gas > Downstream (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Locomotion (1.00)

Add feedback

Optimization-Based Control for Dynamic Legged Robots

Wensing, Patrick M., Posa, Michael, Hu, Yue, Escande, Adrien, Mansard, Nicolas, Del Prete, Andrea

arXiv.org Artificial IntelligenceNov-21-2022

In a world designed for legs, quadrupeds, bipeds, and humanoids have the opportunity to impact emerging robotics applications from logistics, to agriculture, to home assistance. The goal of this survey is to cover the recent progress toward these applications that has been driven by model-based optimization for the real-time generation and control of movement. The majority of the research community has converged on the idea of generating locomotion control laws by solving an optimal control problem (OCP) in either a model-based or data-driven manner. However, solving the most general of these problems online remains intractable due to complexities from intermittent unidirectional contacts with the environment, and from the many degrees of freedom of legged robots. This survey covers methods that have been pursued to make these OCPs computationally tractable, with specific focus on how environmental contacts are treated, how the model can be simplified, and how these choices affect the numerical solution methods employed. The survey focuses on model-based optimization, covering its recent use in a stand alone fashion, and suggesting avenues for combination with learning-based formulations to further accelerate progress in this growing field.

artificial intelligence, robot, survey article, (16 more...)

arXiv.org Artificial Intelligence

2211.11644

Country:

Europe (0.92)
North America > United States (0.45)

Genre: Overview (1.00)

Industry: Energy > Oil & Gas (0.93)

Technology:

Information Technology > Artificial Intelligence > Robots > Locomotion (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)

Add feedback

Constrained Differential Dynamic Programming: A primal-dual augmented Lagrangian approach

Jallet, Wilson, Bambade, Antoine, Mansard, Nicolas, Carpentier, Justin

arXiv.org Artificial IntelligenceOct-28-2022

Trajectory optimization is an efficient approach for solving optimal control problems for complex robotic systems. It relies on two key components: first the transcription into a sparse nonlinear program, and second the corresponding solver to iteratively compute its solution. On one hand, differential dynamic programming (DDP) provides an efficient approach to transcribe the optimal control problem into a finite-dimensional problem while optimally exploiting the sparsity induced by time. On the other hand, augmented Lagrangian methods make it possible to formulate efficient algorithms with advanced constraint-satisfaction strategies. In this paper, we propose to combine these two approaches into an efficient optimal control algorithm accepting both equality and inequality constraints. Based on the augmented Lagrangian literature, we first derive a generic primal-dual augmented Lagrangian strategy for nonlinear problems with equality and inequality constraints. We then apply it to the dynamic programming principle to solve the value-greedy optimization problems inherent to the backward pass of DDP, which we combine with a dedicated globalization strategy, resulting in a Newton-like algorithm for solving constrained trajectory optimization problems. Contrary to previous attempts of formulating an augmented Lagrangian version of DDP, our approach exhibits adequate convergence properties without any switch in strategies. We empirically demonstrate its interest with several case-studies from the robotics literature.

artificial intelligence, constraint, optimization problem, (13 more...)

arXiv.org Artificial Intelligence

2210.15409

Country: Europe > France (0.46)

Genre: Research Report (0.40)

Industry: Energy (0.46)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)

Add feedback