Reviews: Verifiable Reinforcement Learning via Policy Extraction

Oct-8-2024, 08:11:37 GMT–Neural Information Processing Systems

Post rebuttal Thank the authors for the clarification. One minor point I realised is the equation between line 144 and 145. Is this constraint really a disjunction over partitions? If there is at least one partition the given state doesn't belong to, it would be always true because at least one of inner propositions will be true, wouldn't it? The trained decision tree policy allows for its verification in terms of, more specifically, correctness, stability and robustness.

decision tree policy, policy extraction, verifiable reinforcement learning, (8 more...)

Neural Information Processing Systems

Oct-8-2024, 08:11:37 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.40)