Safety-Oriented Pruning and Interpretation of Reinforcement Learning Policies

Sep-16-2024–arXiv.org Artificial Intelligence

Pruning neural networks (NNs) can streamline them but risks removing vital parameters from safe reinforcement learning (RL) policies. We introduce an interpretable RL method called VERINTER, which combines NN pruning with model checking to ensure interpretable RL safety. VERINTER exactly quantifies the effects of pruning and the impact of neural connections on complex safety properties by analyzing changes in safety measurements. This method maintains safety in pruned RL policies and enhances understanding of their safety dynamics, which has proven effective in multiple RL settings.

pruning, reinforcement, safety, (12 more...)

arXiv.org Artificial Intelligence

Sep-16-2024

arXiv.org PDF

Add feedback

Country:
- Europe > Norway > Eastern Norway > Oslo (0.04)

Genre:
- Research Report (0.64)
- Overview (0.47)

Industry:
- Transportation
  - Passenger (0.49)
  - Ground > Road (0.48)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Neural Networks (0.89)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found