FlowBotHD: History-Aware Diffuser Handling Ambiguities in Articulated Objects Manipulation
Li, Yishu, Leng, Wen Hui, Fang, Yiming, Eisner, Ben, Held, David
–arXiv.org Artificial Intelligence
We introduce a novel approach for manipulating articulated objects which are visually ambiguous, such doors which are symmetric or which are heavily occluded. These ambiguities can cause uncertainty over different possible articulation modes: for instance, when the articulation direction (e.g. push, pull, slide) or location (e.g. left side, right side) of a fully closed door are uncertain, or when distinguishing features like the plane of the door are occluded due to the viewing angle. To tackle these challenges, we propose a history-aware diffusion network that can model multi-modal distributions over articulation modes for articulated objects; our method further uses observation history to distinguish between modes and make stable predictions under occlusions. Experiments and analysis demonstrate that our method achieves state-of-art performance on articulated object manipulation and dramatically improves performance for articulated objects containing visual ambiguities. Our project website is available at https://flowbothd.github.io/.
arXiv.org Artificial Intelligence
Dec-28-2024
- Country:
- North America > United States (0.46)
- Genre:
- Research Report (1.00)
- Workflow (1.00)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning (1.00)
- Representation & Reasoning (0.68)
- Robots (1.00)
- Vision (0.68)
- Information Technology > Artificial Intelligence