BREEN: Bridge Data-Efficient Encoder-Free Multimodal Learning with Learnable Queries

Open in new window