Semantic Mapping in Indoor Embodied AI -- A Comprehensive Survey and Future Directions
Raychaudhuri, Sonia, Chang, Angel X.
–arXiv.org Artificial Intelligence
Among many skills that the agents need to possess, building and maintaining a semantic map of the environment is most crucial in long-horizon tasks. A semantic map captures information about the environment in a structured way, allowing the agent to reference it for advanced reasoning throughout the task. While existing surveys in embodied AI focus on general advancements or specific tasks like navigation and manipulation, this paper provides a comprehensive review of semantic map-building approaches in embodied AI, specifically for indoor navigation. We categorize these approaches based on their structural representation (spatial grids, topological graphs, dense point-clouds or hybrid maps) and the type of information they encode (implicit features or explicit environmental data). We also explore the strengths and limitations of the map building techniques, highlight current challenges, and propose future research directions. We identify that the field is moving towards developing open-vocabulary, queryable, task-agnostic map representations, while high memory demands and computational inefficiency still remaining to be open challenges. This survey aims to guide current and future researchers in advancing semantic mapping techniques for embodied AI systems.
arXiv.org Artificial Intelligence
Jan-10-2025
- Country:
- North America > United States (0.28)
- Genre:
- Overview (1.00)
- Technology:
- Information Technology
- Artificial Intelligence
- Cognitive Science (1.00)
- Machine Learning > Neural Networks
- Deep Learning (0.93)
- Natural Language
- Large Language Model (0.93)
- Text Processing (1.00)
- Representation & Reasoning > Agents (1.00)
- Robots (1.00)
- Vision (1.00)
- Sensing and Signal Processing > Image Processing (0.68)
- Artificial Intelligence
- Information Technology