Explainable RL Policies by Distilling to Locally-Specialized Linear Policies with Voronoi State Partitioning
Deproost, Senne, Steckelmacher, Dennis, Nowé, Ann
–arXiv.org Artificial Intelligence
Deep Reinforcement Learning is one of the state-of-the-art methods for producing near-optimal system controllers. However, deep RL algorithms train a deep neural network, that lacks transparency, which poses challenges when the controller has to meet regulations, or foster trust. To alleviate this, one could transfer the learned behaviour into a model that is human-readable by design using knowledge distilla- tion. Often this is done with a single model which mimics the original model on average but could struggle in more dynamic situations. A key challenge is that this simpler model should have the right balance be- tween flexibility and complexity or right balance between balance bias and accuracy. We propose a new model-agnostic method to divide the state space into regions where a simplified, human-understandable model can operate in. In this paper, we use Voronoi partitioning to find regions where linear models can achieve similar performance to the original con- troller. We evaluate our approach on a gridworld environment and a classic control task. We observe that our proposed distillation to locally- specialized linear models produces policies that are explainable and show that the distillation matches or even slightly outperforms the black-box policy they are distilled from.
arXiv.org Artificial Intelligence
Nov-18-2025
- Country:
- Europe
- Belgium
- Brussels-Capital Region > Brussels (0.04)
- Flanders (0.04)
- Middle East > Cyprus
- Switzerland (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Belgium
- North America
- Canada > British Columbia
- United States
- California
- San Diego County > San Diego (0.04)
- San Francisco County > San Francisco (0.14)
- Florida > Sarasota County
- Sarasota (0.04)
- Indiana (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- New York > New York County
- New York City (0.04)
- Wisconsin > Dane County
- Madison (0.04)
- California
- Europe
- Genre:
- Research Report > Promising Solution (0.54)
- Technology: