Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding