Policy Optimization with Linear Temporal Logic Constraints
Voloshin, Cameron, Le, Hoang M., Chaudhuri, Swarat, Yue, Yisong
–arXiv.org Artificial Intelligence
We study the problem of policy optimization (PO) with linear temporal logic (LTL) constraints. The language of LTL allows flexible description of tasks that may be unnatural to encode as a scalar cost function. We consider LTL-constrained PO as a systematic framework, decoupling task specification from policy selection, and as an alternative to the standard of cost shaping. With access to a generative model, we develop a model-based approach that enjoys a sample complexity analysis for guaranteeing both task satisfaction and cost optimality (through a reduction to a reachability problem). Empirically, our algorithm can achieve strong performance even in low-sample regimes.
arXiv.org Artificial Intelligence
Oct-19-2022
- Country:
- North America > United States
- Michigan (0.04)
- Virginia > Arlington County
- Arlington (0.04)
- Massachusetts > Middlesex County
- Florida > Palm Beach County
- Boca Raton (0.04)
- California > Alameda County
- Berkeley (0.04)
- North America > United States
- Genre:
- Research Report (0.63)
- Overview (0.45)