AITopics | mc-pilco

Collaborating Authors

mc-pilco

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Accelerating Model-Based Reinforcement Learning using Non-Linear Trajectory Optimization

Calì, Marco, Giacomuzzo, Giulio, Carli, Ruggero, Libera, Alberto Dalla

arXiv.org Artificial IntelligenceJun-4-2025

This paper addresses the slow policy optimization convergence of Monte Carlo Probabilistic Inference for Learning Control (MC-PILCO), a state-of-the-art model-based reinforcement learning (MBRL) algorithm, by integrating it with iterative Linear Quadratic Regulator (iLQR), a fast trajectory optimization method suitable for nonlinear systems. The proposed method, Exploration-Boosted MC-PILCO (EB-MC-PILCO), leverages iLQR to generate informative, exploratory trajectories and initialize the policy, significantly reducing the number of required optimization steps. Experiments on the cart-pole task demonstrate that EB-MC-PILCO accelerates convergence compared to standard MC-PILCO, achieving up to $\bm{45.9\%}$ reduction in execution time when both methods solve the task in four trials. EB-MC-PILCO also maintains a $\bm{100\%}$ success rate across trials while solving the task faster, even in cases where MC-PILCO converges in fewer iterations.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Artificial Intelligence

2506.02767

Country: Europe (0.28)

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.61)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.48)

Add feedback

Learning global control of underactuated systems with Model-Based Reinforcement Learning

Turcato, Niccolò, Calì, Marco, Libera, Alberto Dalla, Giacomuzzo, Giulio, Carli, Ruggero, Romeres, Diego

arXiv.org Artificial IntelligenceApr-10-2025

Learning global control of underactuated systems with Model-Based Reinforcement Learning Niccol ` o Turcato 1, Marco Cal ` ı 1, Alberto Dalla Libera 1, Giulio Giacomuzzo 1, Ruggero Carli 1 and Diego Romeres 2 Abstract -- This short paper describes our proposed solution for the third edition of the "AI Olympics with RealAIGym" competition, held at ICRA 2025. We employed Monte-Carlo Probabilistic Inference for Learning Control (MC-PILCO), an MBRL algorithm recognized for its exceptional data efficiency across various low-dimensional robotic tasks, including cart-pole, ball & plate, and Furuta pendulum systems. This approach has proven highly effective in physical systems, offering greater data efficiency than Model-Free (MF) alternatives. Notably, MC-PILCO has previously won the first two editions of this competition, demonstrating its robustness in both simulated and real-world environments. Besides briefly reviewing the algorithm, we discuss the most critical aspects of the MC-PILCO implementation in the tasks at hand: learning a global policy for the pendubot and acrobot systems.

controller, machine learning, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

2504.06721

Country: Europe (0.47)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.71)

Add feedback

Reinforcement Learning for Robust Athletic Intelligence: Lessons from the 2nd 'AI Olympics with RealAIGym' Competition

Wiebe, Felix, Turcato, Niccolò, Libera, Alberto Dalla, Choe, Jean Seong Bjorn, Choi, Bumkyu, Faust, Tim Lukas, Maraqten, Habib, Aghadavoodi, Erfan, Cali, Marco, Sinigaglia, Alberto, Giacomuzzo, Giulio, Romeres, Diego, Kim, Jong-kook, Susto, Gian Antonio, Vyas, Shubham, Mronga, Dennis, Belousov, Boris, Peters, Jan, Kirchner, Frank, Kumar, Shivesh

arXiv.org Artificial IntelligenceMar-19-2025

In the field of robotics many different approaches ranging from classical planning over optimal control to reinforcement learning (RL) are developed and borrowed from other fields to achieve reliable control in diverse tasks. In order to get a clear understanding of their individual strengths and weaknesses and their applicability in real world robotic scenarios is it important to benchmark and compare their performances not only in a simulation but also on real hardware. The '2nd AI Olympics with RealAIGym' competition was held at the IROS 2024 conference to contribute to this cause and evaluate different controllers according to their ability to solve a dynamic control problem on an underactuated double pendulum system with chaotic dynamics. This paper describes the four different RL methods submitted by the participating teams, presents their performance in the swing-up task on a real double pendulum, measured against various criteria, and discusses their transferability from simulation to real hardware and their robustness to external disturbances.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

arXiv.org Artificial Intelligence

2503.1529

Country:

Europe > Germany > Bremen > Bremen (0.14)
Europe > Germany > Hesse > Darmstadt Region > Darmstadt (0.05)
North America > United States (0.04)
(4 more...)

Genre:

Research Report (0.64)
Workflow (0.46)

Industry:

Leisure & Entertainment (0.46)
Government > Regional Government (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.68)

Add feedback

Learning control of underactuated double pendulum with Model-Based Reinforcement Learning

Turcato, Niccolò, Libera, Alberto Dalla, Giacomuzzo, Giulio, Carli, Ruggero, Romeres, Diego

arXiv.org Artificial IntelligenceSep-9-2024

This report describes our proposed solution for the second AI Olympics competition held at IROS 2024. Our solution is based on a recent Model-Based Reinforcement Learning algorithm named MC-PILCO. Besides briefly reviewing the algorithm, we discuss the most critical aspects of the MC-PILCO implementation in the tasks at hand.

algorithm, controller, mc-pilco, (13 more...)

arXiv.org Artificial Intelligence

2409.05811

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Italy (0.04)
Europe > Germany > Bremen > Bremen (0.04)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Model-Based Policy Search Using Monte Carlo Gradient Estimation with Real Systems Application

Amadio, Fabio, Libera, Alberto Dalla, Antonello, Riccardo, Nikovski, Daniel, Carli, Ruggero, Romeres, Diego

arXiv.org Artificial IntelligenceSep-6-2022

In this paper, we present a Model-Based Reinforcement Learning (MBRL) algorithm named \emph{Monte Carlo Probabilistic Inference for Learning COntrol} (MC-PILCO). The algorithm relies on Gaussian Processes (GPs) to model the system dynamics and on a Monte Carlo approach to estimate the policy gradient. This defines a framework in which we ablate the choice of the following components: (i) the selection of the cost function, (ii) the optimization of policies using dropout, (iii) an improved data efficiency through the use of structured kernels in the GP models. The combination of the aforementioned aspects affects dramatically the performance of MC-PILCO. Numerical comparisons in a simulated cart-pole environment show that MC-PILCO exhibits better data efficiency and control performance w.r.t. state-of-the-art GP-based MBRL algorithms. Finally, we apply MC-PILCO to real systems, considering in particular systems with partially measurable states. We discuss the importance of modeling both the measurement system and the state estimators during policy optimization. The effectiveness of the proposed solutions has been tested in simulation and on two real systems, a Furuta pendulum and a ball-and-plate rig.

artificial intelligence, machine learning, mc-pilco, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/TRO.2022.3184837

2101.12115

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Florida > Broward County > Fort Lauderdale (0.04)
North America > United States > California (0.04)
(2 more...)

Genre:

Research Report > Experimental Study (0.47)
Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.87)
(2 more...)

Add feedback