UFC-BERT: UnifyingMulti-ModalControlsfor ConditionalImageSynthesis