Robust exploration in linear quadratic reinforcement learning

Jack Umenberger, Mina Ferizbegovic, Thomas B. Schön, Håkan Hjalmarsson

Jan-21-2025, 08:11:36 GMT–Neural Information Processing Systems

This paper concerns the problem of learning control policies for an unknown linear dynamical system to minimize a quadratic cost function. We present a method, based on convex optimization, that accomplishes this task robustly: i.e., we minimize the worst-case cost, accounting for system uncertainty given the observed data. The method balances exploitation and exploration, exciting the system in such a way so as to reduce uncertainty in the model parameters to which the worst-case cost is most sensitive. Numerical simulations and application to a hardware-in-the-loop servo-mechanism demonstrate the approach, with appreciable performance and robustness gains over alternative methods observed in both.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Jan-21-2025, 08:11:36 GMT

Conferences PDF

Add feedback

Country:
- Europe > Sweden (0.14)
- North America > Canada (0.14)

Industry:
- Energy > Oil & Gas (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (0.65)
  - Representation & Reasoning (1.00)

Duplicate Docs Excel Report

Title
Robust exploration in linear quadratic reinforcement learning

Similar Docs Excel Report more

Title	Similarity	Source
None found