Multi-modal Auto-regressive Modeling via Visual Words

Open in new window