Reviews: Robust exploration in linear quadratic reinforcement learning
–Neural Information Processing Systems
The paper presents a new technique for robust optimization and balanced exploration in LQR problems. The technique is quite innovative since it leverages semidefinite programming instead of dynamic programming. This is an important algorithmic contribution with solid theory. For the empirical evaluation, the authors are expected to include the new experiments and running times mentioned in the rebuttal. Overall, this is very nice work.
Neural Information Processing Systems
Jan-21-2025, 08:11:26 GMT
- Technology: