Learning Visual Prior via Generative Pre-Training

Neural Information Processing Systems 

Various stuff and things in visual data possess specific traits, which can be learned by deep neural networks and are implicitly represented as the visual prior, e.g., object location and shape, in the model.