Automatic Testing and Falsification with Dynamically Constrained Reinforcement Learning

Qin, Xin, Aréchiga, Nikos, Best, Andrew, Deshmukh, Jyotirmoy

arXiv.org Artificial Intelligence 

Automatic T esting and Falsification with Dynamically Constrained Reinforcement Learning Xin Qin 1, Nikos Ar echiga 2, Andrew Best 2, Jyotirmoy Deshmukh 1 Abstract -- We consider the problem of using reinforcement learning to train adversarial agents for automatic testing and falsification of cyberphysical systems, such as autonomous vehicles, robots, and airplanes. In order to produce useful agents, however, it is useful to be able to control the degree of adversariality by specifying rules that an agent must follow. For example, when testing an autonomous vehicle, it is useful to find maximally antagonistic traffic participants that obey traffic rules. We model dynamic constraints as hierarchically ordered rules expressed in Signal T emporal Logic, and show how these can be incorporated into an agent training process. We prove that our agent-centric approach is able to find all dangerous behaviors that can be found by traditional falsification techniques while producing modular and reusable agents. We demonstrate our approach on two case studies from the automotive domain. I NTRODUCTION When developing cyberphysical systems such as autonomous vehicles, drones, or aircraft, it is important to have a robust testing strategy that finds critical bugs before the system is put into production. Falsification techniques exist to find simulations in which the system under test fails to satisfy its target specification. These falsification traces can be generated from a bounded set of inputs.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found