Learning control of underactuated double pendulum with Model-Based Reinforcement Learning

Turcato, Niccolò, Libera, Alberto Dalla, Giacomuzzo, Giulio, Carli, Ruggero, Romeres, Diego

arXiv.org Artificial Intelligence 

This report describes our proposed solution for the second AI Olympics competition held at IROS 2024. Our solution is based on a recent Model-Based Reinforcement Learning algorithm named MC-PILCO. Besides briefly reviewing the algorithm, we discuss the most critical aspects of the MC-PILCO implementation in the tasks at hand.