Velocity-History-Based Soft Actor-Critic Tackling IROS'24 Competition "AI Olympics with RealAIGym"
Faust, Tim Lukas, Maraqten, Habib, Aghadavoodi, Erfan, Belousov, Boris, Peters, Jan
–arXiv.org Artificial Intelligence
The ``AI Olympics with RealAIGym'' competition challenges participants to stabilize chaotic underactuated dynamical systems with advanced control algorithms. In this paper, we present a novel solution submitted to IROS'24 competition, which builds upon Soft Actor-Critic (SAC), a popular model-free entropy-regularized Reinforcement Learning (RL) algorithm. We add a `context' vector to the state, which encodes the immediate history via a Convolutional Neural Network (CNN) to counteract the unmodeled effects on the real system. Our method achieves high performance scores and competitive robustness scores on both tracks of the competition: Pendubot and Acrobot.
arXiv.org Artificial Intelligence
Oct-26-2024
- Country:
- Asia > Russia (0.04)
- Europe
- Germany > Hesse
- Darmstadt Region > Darmstadt (0.05)
- Russia (0.04)
- Germany > Hesse
- North America > United States
- New York (0.04)
- Genre:
- Research Report > Promising Solution (0.34)
- Technology: