Remix-DiT: Mixing Diffusion Transformers for Multi-Expert Denoising Gongfan Fang Xinyin Ma Xinchao Wang National University of Singapore

Neural Information Processing Systems 

The goal of multi-expert denoising is to increase the overall capacity of diffusion models while keeping an acceptable overhead.