Human-Inspired Topological Representations for Visual Object Recognition in Unseen Environments
Samani, Ekta U., Banerjee, Ashis G.
–arXiv.org Artificial Intelligence
Visual object recognition in unseen and cluttered indoor environments is a challenging problem for mobile robots. Toward this goal, we extend our previous work to propose the TOPS2 descriptor, and an accompanying recognition framework, THOR2, inspired by a human reasoning mechanism known as object unity. We interleave color embeddings obtained using the Mapper algorithm for topological soft clustering with the shape-based TOPS descriptor to obtain the TOPS2 descriptor. THOR2, trained using synthetic data, achieves substantially higher recognition accuracy than the shape-based THOR framework and outperforms RGB-D ViT on two real-world datasets: the benchmark OCID dataset and the UW-IS Occluded dataset. Therefore, THOR2 is a promising step toward achieving robust recognition in low-cost robots.
arXiv.org Artificial Intelligence
Sep-15-2023
- Country:
- Asia
- Japan (0.04)
- Middle East > Iran
- Tehran Province > Tehran (0.04)
- Europe > United Kingdom (0.04)
- North America > United States
- Washington > King County > Seattle (0.14)
- Oceania > Australia (0.04)
- Asia
- Genre:
- Research Report (0.40)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning > Neural Networks
- Deep Learning (0.46)
- Robots (1.00)
- Vision (1.00)
- Machine Learning > Neural Networks
- Information Technology > Artificial Intelligence