Frozen Transformers in Language Models Are Effective Visual Encoder Layers

Open in new window