In search of the next generation of multimodal datasets

Neural Information Processing Systems 

While these advances use different algorithmic techniques, e.g., contrastive learning, diffusion, or auto-regressive modeling, they all rest on a common foundation: large datasets containing paired image-text examples.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found