Learning Interpretable Classifiers for PDDL Planning
–arXiv.org Artificial Intelligence
We consider the problem of synthesizing interpretable models that recognize the behaviour of an agent compared to other agents, on a whole set of similar planning tasks expressed in PDDL. Our approach consists in learning logical formulas, from a small set of examples that show how an agent solved small planning instances. These formulas are expressed in a version of First-Order Temporal Logic (FTL) tailored to our planning formalism. Such formulas are human-readable, serve as (partial) descriptions of an agent's policy, and generalize to unseen instances. We show that learning such formulas is computationally intractable, as it is an NP-hard problem. As such, we propose to learn these behaviour classifiers through a topology-guided compilation to MaxSAT, which allows us to generate a wide range of different formulas. Experiments show that interesting and accurate formulas can be learned in reasonable time.
arXiv.org Artificial Intelligence
Oct-13-2024
- Country:
- Oceania > Australia (0.04)
- North America > United States
- New York > New York County
- New York City (0.04)
- Massachusetts > Suffolk County
- Boston (0.04)
- New York > New York County
- Europe > France
- Occitanie > Haute-Garonne > Toulouse (0.04)
- Genre:
- Research Report (0.40)
- Technology: