UFC-BERT: Unifying Multi-Modal Controls for Conditional Image Synthesis