RGMDT: Return-Gap-MinimizingDecisionTree ExtractioninNon-EuclideanMetricSpace

Feb-19-2026, 04:31:38 GMT–Neural Information Processing Systems

In this paper, we establish an upper bound on the return gap between the oracle expert policy and an optimal decision tree policy. This enables us to recast the DT extraction problem into a novel non-euclidean clustering problem over the local observation and action values space of each agent, with action values as cluster labels and the upper bound on the return gap as clustering loss.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Feb-19-2026, 04:31:38 GMT

Conferences PDF

Add feedback

Country:
- North America > United States
  - Pennsylvania > Allegheny County
    - Pittsburgh (0.04)
  - California > Monterey County
    - Monterey (0.04)
- Europe
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
  - Finland > Northern Savo
    - Kuopio (0.04)

Genre:
- Research Report (0.67)

Industry:
- Government (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (1.00)
  - Machine Learning > Reinforcement Learning (0.94)

Duplicate Docs Excel Report

Title
21a7b312c42af86b3cd17a26a8ec499e-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found