SafePILCO: a software tool for safe and data-efficient policy synthesis
Polymenakos, Kyriakos, Rontsis, Nikitas, Abate, Alessandro, Roberts, Stephen
SafePILCO is a software tool for safe and data-efficient policy search with reinforcement learning. It extends the known PILCO algorithm, originally written in MATLAB, to support safe learning. We provide a Python implementation and leverage existing libraries that allow the codebase to remain short and modular, which is appropriate for wider use by the verification, reinforcement learning, and control communities.
Aug-7-2020
- Country:
- North America > United States
- California > Alameda County > Berkeley (0.04)
- Europe
- United Kingdom > England
- Oxfordshire > Oxford (0.14)
- Germany > Baden-Württemberg
- Karlsruhe Region > Karlsruhe (0.04)
- United Kingdom > England
- Asia > Middle East
- Jordan (0.04)
- North America > United States
- Genre:
- Research Report (0.82)
- Technology: