Multimodal Data Storage and Retrieval for Embodied AI: A Survey
–arXiv.org Artificial Intelligence
Embodied AI (EAI) agents continuously interact with the physical world, generating vast, heterogeneous multimodal data streams that traditional management systems are ill-equipped to handle. In this survey, we first systematically evaluate five storage architectures (Graph Databases, Multi-Model Databases, Data Lakes, Vector Databases, and Time-Series Databases), focusing on their suitability for addressing EAI's core requirements, including physical grounding, low-latency access, and dynamic scalability. We then analyze five retrieval paradigms (Fusion Strategy-Based Retrieval, Representation Alignment-Based Retrieval, Graph-Structure-Based Retrieval, Generation Model-Based Retrieval, and Efficient Retrieval-Based Optimization), revealing a fundamental tension between achieving long-term semantic coherence and maintaining real-time responsiveness. Based on this comprehensive analysis, we identify key bottlenecks, spanning from the foundational Physical Grounding Gap to systemic challenges in cross-modal integration, dynamic adaptation, and open-world generalization. Finally, we outline a forward-looking research agenda encompassing physics-aware data models, adaptive storage-retrieval co-optimization, and standardized benchmarking, to guide future research toward principled data management solutions for EAI. Our survey is based on a comprehensive review of more than 180 related studies, providing a rigorous roadmap for designing the robust, high-performance data management frameworks essential for the next generation of autonomous embodied systems.
arXiv.org Artificial Intelligence
Aug-20-2025
- Country:
- Asia
- China
- Beijing > Beijing (0.04)
- Guangdong Province > Guangzhou (0.04)
- Middle East > Jordan (0.04)
- China
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- Asia
- Genre:
- Overview (1.00)
- Industry:
- Education (0.67)
- Health & Medicine > Therapeutic Area
- Neurology (0.46)
- Information Technology (0.93)
- Technology:
- Information Technology
- Architecture > Real Time Systems (1.00)
- Artificial Intelligence
- Cognitive Science > Problem Solving (1.00)
- Machine Learning
- Neural Networks > Deep Learning (0.46)
- Statistical Learning (0.67)
- Natural Language
- Information Retrieval > Query Processing (0.67)
- Large Language Model (0.93)
- Representation & Reasoning > Agents (1.00)
- Robots (1.00)
- Vision (1.00)
- Data Science > Data Mining (1.00)
- Information Management (1.00)
- Sensing and Signal Processing > Image Processing (0.93)
- Information Technology