Classical Planning in Deep Latent Space
Asai, Masataro, Kajino, Hiroshi, Fukunaga, Alex, Muise, Christian
–arXiv.org Artificial Intelligence
Current domain-independent, classical planners require symbolic models of the problem domain and instance as input, resulting in a knowledge acquisition bottleneck. Meanwhile, although deep learning has achieved significant success in many fields, the knowledge is encoded in a subsymbolic representation which is incompatible with symbolic systems such as planners. We propose Latplan, an unsupervised architecture combining deep learning and classical planning. Given only an unlabeled set of image pairs showing a subset of transitions allowed in the environment (training inputs), Latplan learns a complete propositional PDDL action model of the environment. Later, when a pair of images representing the initial and the goal states (planning inputs) is given, Latplan finds a plan to the goal state in a symbolic latent space and returns a visualized plan execution. We evaluate Latplan using image-based versions of 6 planning domains: 8-puzzle, 15-Puzzle, Blocksworld, Sokoban and Two variations of LightsOut.
arXiv.org Artificial Intelligence
Jun-30-2021
- Country:
- North America
- United States > Massachusetts
- Middlesex County > Belmont (0.04)
- Canada > Ontario
- Kingston (0.04)
- United States > Massachusetts
- Europe
- France (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Germany > Brandenburg
- Potsdam (0.04)
- Asia
- North America
- Genre:
- Research Report > New Finding (0.67)
- Industry:
- Leisure & Entertainment > Games (1.00)
- Technology:
- Information Technology
- Sensing and Signal Processing > Image Processing (1.00)
- Knowledge Management > Knowledge Engineering (1.00)
- Artificial Intelligence
- Robots (1.00)
- Natural Language (1.00)
- Cognitive Science > Problem Solving (1.00)
- Representation & Reasoning
- Search (1.00)
- Planning & Scheduling (1.00)
- Expert Systems (1.00)
- Uncertainty > Bayesian Inference (0.92)
- Machine Learning
- Statistical Learning (1.00)
- Neural Networks > Deep Learning (1.00)
- Reinforcement Learning (0.92)
- Learning Graphical Models > Directed Networks
- Bayesian Learning (0.92)
- Information Technology