GSplatVNM: Point-of-View Synthesis for Visual Navigation Models Using Gaussian Splatting
Honda, Kohei, Ishita, Takeshi, Yoshimura, Yasuhiro, Yonetani, Ryo
–arXiv.org Artificial Intelligence
This paper presents a novel approach to image-goal navigation by integrating 3D Gaussian Splatting (3DGS) with Visual Navigation Models (VNMs), a method we refer to as GSplatVNM. VNMs offer a promising paradigm for image-goal navigation by guiding a robot through a sequence of point-of-view images without requiring metrical localization or environment-specific training. However, constructing a dense and traversable sequence of target viewpoints from start to goal remains a central challenge, particularly when the available image database is sparse. To address these challenges, we propose a 3DGS-based viewpoint synthesis framework for VNMs that synthesizes intermediate viewpoints to seamlessly bridge gaps in sparse data while significantly reducing storage overhead. Experimental results in a photorealistic simulator demonstrate that our approach not only enhances navigation efficiency but also exhibits robustness under varying levels of image database sparsity.
arXiv.org Artificial Intelligence
Mar-10-2025
- Genre:
- Research Report > Promising Solution (0.48)
- Technology:
- Information Technology > Artificial Intelligence
- Vision (1.00)
- Robots (1.00)
- Representation & Reasoning (1.00)
- Machine Learning (1.00)
- Information Technology > Artificial Intelligence