SSEditor: Controllable Mask-to-Scene Generation with Diffusion Model