Unsupervised Segmentation in Real-World Images via Spelke Object Inference
Chen, Honglin, Venkatesh, Rahul, Friedman, Yoni, Wu, Jiajun, Tenenbaum, Joshua B., Yamins, Daniel L. K., Bear, Daniel M.
–arXiv.org Artificial Intelligence
Self-supervised, category-agnostic segmentation of real-world images is a challenging open problem in computer vision. Here, we show how to learn static grouping priors from motion self-supervision by building on the cognitive science concept of a Spelke Object: a set of physical stuff that moves together. We introduce the Excitatory-Inhibitory Segment Extraction Network (EISEN), which learns to extract pairwise affinity graphs for static scenes from motion-based training signals. EISEN then produces segments from affinities using a novel graph propagation and competition network. During training, objects that undergo correlated motion (such as robot arms and the objects they move) are decoupled by a bootstrapping process: EISEN explains away the motion of objects it has already learned to segment. We show that EISEN achieves a substantial improvement in the state of the art for self-supervised image segmentation on challenging synthetic and real-world robotics datasets.
arXiv.org Artificial Intelligence
Jul-25-2022
- Country:
- North America > United States
- Pennsylvania (0.04)
- California
- Alameda County > Berkeley (0.04)
- Santa Clara County > Palo Alto (0.04)
- North America > United States
- Genre:
- Research Report (0.64)
- Industry:
- Government > Regional Government (0.46)
- Technology:
- Information Technology > Artificial Intelligence
- Vision (1.00)
- Robots (1.00)
- Machine Learning
- Statistical Learning (0.68)
- Performance Analysis > Accuracy (0.35)
- Information Technology > Artificial Intelligence