AITopics | Stasse, Olivier

Collaborating Authors

Stasse, Olivier

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Reinforcement Learning from Wild Animal Videos

Chane-Sane, Elliot, Roux, Constant, Stasse, Olivier, Mansard, Nicolas

arXiv.org Artificial IntelligenceDec-5-2024

We propose to learn legged robot locomotion skills by watching thousands of wild animal videos from the internet, such as those featured in nature documentaries. Indeed, such videos offer a rich and diverse collection of plausible motion examples, which could inform how robots should move. To achieve this, we introduce Reinforcement Learning from Wild Animal Videos (RLWAV), a method to ground these motions into physical robots. We first train a video classifier on a large-scale animal video dataset to recognize actions from RGB clips of animals in their natural habitats. We then train a multi-skill policy to control a robot in a physics simulator, using the classification score of a third-person camera capturing videos of the robot's movements as a reward for reinforcement learning. Finally, we directly transfer the learned policy to a real quadruped Solo. Remarkably, despite the extreme gap in both domain and embodiment between animals in the wild and robots, our approach enables the policy to learn diverse skills such as walking, jumping, and keeping still, without relying on reference trajectories nor skill-specific rewards.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

arXiv.org Artificial Intelligence

2412.04273

Country: Europe > France (0.14)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Whole-body MPC and sensitivity analysis of a real time foot step sequencer for a biped robot Bolt

Roux, Constant, Perrot, Côme, Stasse, Olivier

arXiv.org Artificial IntelligenceDec-2-2024

Abstract--This paper presents a novel controller for the bipedal robot Bolt. Our approach leverages a whole-body model predictive controller in conjunction with a footstep sequencer to achieve robust locomotion. Simulation results demonstrate effective velocity tracking as well as push and slippage recovery abilities. In addition to that, we provide a theoretical sensitivity analysis of the footstep sequencing problem to enhance the understanding of the results. A. Context Bipedal robotics, with its origins tracing back to the end of the last century, has witnessed a significant surge in recent years.

artificial intelligence, perturbation, robot, (12 more...)

arXiv.org Artificial Intelligence

2412.01713

Genre: Research Report > New Finding (0.66)

Industry:

Energy > Oil & Gas > Downstream (0.45)
Energy > Oil & Gas > Upstream (0.35)

Technology: Information Technology > Artificial Intelligence > Robots > Locomotion (0.48)

Add feedback

NAS: N-step computation of All Solutions to the footstep planning problem

Wang, Jiayi, Samadi, Saeid, Wang, Hefan, Fernbach, Pierre, Stasse, Olivier, Vijayakumar, Sethu, Tonneau, Steve

arXiv.org Artificial IntelligenceJul-17-2024

How many ways are there to climb a staircase in a given number of steps? Infinitely many, if we focus on the continuous aspect of the problem. A finite, possibly large number if we consider the discrete aspect, i.e. on which surface which effectors are going to step and in what order. We introduce NAS, an algorithm that considers both aspects simultaneously and computes all the possible solutions to such a contact planning problem, under standard assumptions. To our knowledge NAS is the first algorithm to produce a globally optimal policy, efficiently queried in real time for planning the next footsteps of a humanoid robot. Our empirical results (in simulation and on the Talos platform) demonstrate that, despite the theoretical exponential complexity, optimisations reduce the practical complexity of NAS to a manageable bilinear form, maintaining completeness guarantees and enabling efficient GPU parallelisation. NAS is demonstrated in a variety of scenarios for the Talos robot, both in simulation and on the hardware platform. Future work will focus on further reducing computation times and extending the algorithm's applicability beyond gaited locomotion. Our companion video is available at https://youtu.be/Shkf8PyDg4g

artificial intelligence, node, planning & scheduling, (19 more...)

arXiv.org Artificial Intelligence

2407.12962

Country: Europe > France (0.15)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots > Locomotion (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.71)

Add feedback

CaT: Constraints as Terminations for Legged Locomotion Reinforcement Learning

Chane-Sane, Elliot, Leziart, Pierre-Alexandre, Flayols, Thomas, Stasse, Olivier, Souères, Philippe, Mansard, Nicolas

arXiv.org Artificial IntelligenceMar-27-2024

Deep Reinforcement Learning (RL) has demonstrated impressive results in solving complex robotic tasks such as quadruped locomotion. Yet, current solvers fail to produce efficient policies respecting hard constraints. In this work, we advocate for integrating constraints into robot learning and present Constraints as Terminations (CaT), a novel constrained RL algorithm. Departing from classical constrained RL formulations, we reformulate constraints through stochastic terminations during policy learning: any violation of a constraint triggers a probability of terminating potential future rewards the RL agent could attain. We propose an algorithmic approach to this formulation, by minimally modifying widely used off-the-shelf RL algorithms in robot learning (such as Proximal Policy Optimization). Our approach leads to excellent constraint adherence without introducing undue complexity and computational overhead, thus mitigating barriers to broader adoption. Through empirical evaluation on the real quadruped robot Solo crossing challenging obstacles, we demonstrate that CaT provides a compelling solution for incorporating constraints into RL frameworks. Videos and code are available at https://constraints-as-terminations.github.io.

constraint, machine learning, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

2403.18765

Country: Europe > France > Occitanie (0.14)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Robots > Locomotion (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback