SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data

Neural Information Processing Systems 

Recent text-to-image (T2I) generation models have demonstrated impressive capabilities in creating images from text descriptions.