Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions
–Neural Information Processing Systems
Image description datasets play a crucial role in the advancement of various applications such as image understanding, text-to-image generation, and text-image retrieval. Currently, image description datasets primarily originate from two sources. One source is the scraping of image-text pairs from the web. Despite their abundance, these descriptions are often of low quality and noisy. Another way is through human labeling.
Neural Information Processing Systems
Dec-27-2025, 05:54:31 GMT
- Technology: