Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads

Open in new window