Quantum deep reinforcement learning for humanoid robot navigation task
Lokossou, Romerik, Girma, Birhanu Shimelis, Tonguz, Ozan K., Biyabani, Ahmed
–arXiv.org Artificial Intelligence
Abstract--Classical reinforcement learning (RL) methods often struggle in complex, high-dimensional environments because of their extensive parameter requirements and challenges posed by stochastic, non-deterministic settings. This study introduces quantum deep reinforcement learning (QDRL) to train humanoid agents efficiently. While previous quantum RL models focused on smaller environments, such as wheeled robots and robotic arms, our work pioneers the application of QDRL to humanoid robotics, specifically in environments with substantial observation and action spaces, such as MuJoCo's Humanoid-v4 and Walker2d-v4. Using parameterized quantum circuits, we explored a hybrid quantum-classical setup to directly navigate high-dimensional state spaces, bypassing traditional mapping and planning. By integrating quantum computing with deep RL, we aim to develop models that can efficiently learn complex navigation tasks in humanoid robots. We evaluated the performance of the Soft Actor-Critic (SAC) in classical RL against its quantum implementation. The results show that the quantum SAC achieves an 8% higher average return (246.40)
arXiv.org Artificial Intelligence
Sep-16-2025
- Country:
- Africa (0.04)
- North America > United States
- Pennsylvania > Allegheny County > Pittsburgh (0.04)
- Genre:
- Research Report (0.70)
- Technology: