Zero-Shot Scene Reconstruction from Single Images with Deep Prior Assembly

Neural Information Processing Systems 

Large language and vision models have been leading a revolution in visual computing. By greatly scaling up sizes of data and model parameters, the large models learn deep priors which lead to remarkable performance in various tasks.