Goto

Collaborating Authors

 Nagano Prefecture










MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views Y uedong Chen

Neural Information Processing Systems

Diffusion (SVD) model, where these features then act as pose and visual cues to guide the denoising process and produce photorealistic 3D-consistent views. Our model is end-to-end trainable and supports rendering arbitrary views with as few as 5 sparse input views. To evaluate MVSplat360's performance, we introduce a new benchmark using the challenging DL3DV -10K dataset, where