question
Country:
- North America > United States > Michigan (0.04)
- Asia > China > Guangdong Province > Shenzhen (0.04)
Technology:
Technology:
Industry:
- Health & Medicine > Diagnostic Medicine > Imaging (0.46)
- Health & Medicine > Therapeutic Area > Oncology (0.31)
Technology:
Country:
- Oceania > Australia (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Technology:
Country:
Technology:
RGMDT: Return-Gap-MinimizingDecisionTree ExtractioninNon-EuclideanMetricSpace
In this paper, we establish an upper bound on the return gap between the oracle expert policy and an optimal decision tree policy. This enables us to recast the DT extraction problem into a novel non-euclidean clustering problem over the local observation and action values space of each agent, with action values as cluster labels and the upper bound on the return gap as clustering loss.
Country:
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- North America > United States > California > Monterey County > Monterey (0.04)
- Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
- Europe > Finland > Northern Savo > Kuopio (0.04)
Technology: