Rational Inverse Reasoning
Zandonati, Ben, Lozano-Pérez, Tomás, Kaelbling, Leslie Pack
–arXiv.org Artificial Intelligence
Humans can observe a single, imperfect demonstration and immediately generalize to very different problem settings. Robots, in contrast, often require hundreds of examples and still struggle to generalize beyond the training conditions. We argue that this limitation arises from the inability to recover the latent explanations that underpin intelligent behavior, and that these explanations can take the form of structured programs consisting of high-level goals, sub-task decomposition, and execution constraints. In this work, we introduce Rational Inverse Reasoning (RIR), a framework for inferring these latent programs through a hierarchical generative model of behavior. RIR frames few-shot imitation as Bayesian program induction: a vision-language model iteratively proposes structured symbolic task hypotheses, while a planner-in-the-loop inference scheme scores each by the likelihood of the observed demonstration under that hypothesis. This loop yields a posterior over concise, executable programs. We evaluate RIR on a suite of continuous manipulation tasks designed to test one-shot and few-shot generalization across variations in object pose, count, geometry, and layout. With as little as one demonstration, RIR infers the intended task structure and generalizes to novel settings, outperforming state-of-the-art vision-language model baselines.
arXiv.org Artificial Intelligence
Aug-13-2025
- Country:
- North America > United States (0.28)
- Genre:
- Research Report (0.64)
- Technology:
- Information Technology > Artificial Intelligence
- Robots (1.00)
- Representation & Reasoning > Constraint-Based Reasoning (1.00)
- Machine Learning (1.00)
- Cognitive Science (1.00)
- Natural Language > Large Language Model (0.68)
- Information Technology > Artificial Intelligence