Quantum Reinforcement Learning by Adaptive Non-local Observables
Lin, Hsin-Yi, Chen, Samuel Yen-Chi, Tseng, Huan-Hsin, Yoo, Shinjae
–arXiv.org Artificial Intelligence
Hybrid quantum-classical frameworks leverage quantum computing for machine learning; however, variational quantum circuits (VQCs) are limited by the need for local measurements. We introduce an adaptive non-local observable (ANO) paradigm within VQCs for quantum reinforcement learning (QRL), jointly optimizing circuit parameters and multi-qubit measurements. The ANO-VQC architecture serves as the function approximator in Deep Q-Network (DQN) and Asynchronous Advantage Actor-Critic (A3C) algorithms. On multiple benchmark tasks, ANO-VQC agents outperform baseline VQCs. Ablation studies reveal that adaptive measurements enhance the function space without increasing circuit depth. Our results demonstrate that adaptive multi-qubit observables can enable practical quantum advantages in reinforcement learning.
arXiv.org Artificial Intelligence
Jul-29-2025
- Country:
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States
- New Jersey > Essex County
- Orange (0.04)
- South Orange (0.04)
- New Jersey > Essex County
- Europe > United Kingdom
- Genre:
- Research Report > New Finding (1.00)
- Technology: