Compositional Image Synthesis with Inference-Time Scaling