Image Parsing via Stochastic Scene Grammar

Neural Information Processing Systems 

This paper proposes a parsing algorithm for scene understanding which includes four aspects: computing 3D scene layout, detecting 3D objects (e.g.