Verifiable Reinforcement Learning via Policy Extraction

Open in new window