NewsStories: Illustrating articles with visual summaries
Tan, Reuben, Plummer, Bryan A., Saenko, Kate, Lewis, JP, Sud, Avneesh, Leung, Thomas
–arXiv.org Artificial Intelligence
Recent self-supervised approaches have used large-scale image-text datasets to learn powerful representations that transfer to many tasks without finetuning. These methods often assume that there is one-to-one correspondence between its images and their (short) captions. However, many tasks require reasoning about multiple images and long text narratives, such as describing news articles with visual summaries. Thus, we explore a novel setting where the goal is to learn a self-supervised visual-language representation that is robust to varying text length and the number of images. In addition, unlike prior work which assumed captions have a literal relation to the image, we assume images only contain loose illustrative correspondence with the text. To explore this problem, we introduce a large-scale multimodal dataset containing over 31M articles, 22M images and 1M videos. We show that state-of-the-art image-text alignment methods are not robust to longer narratives with multiple images. Finally, we introduce an intuitive baseline that outperforms these methods on zero-shot image-set retrieval by 10% on the GoodNews dataset.
arXiv.org Artificial Intelligence
Aug-14-2022
- Country:
- Africa > Middle East
- Asia
- China (0.28)
- India (0.04)
- Middle East
- Iraq (0.04)
- Republic of Türkiye
- Ankara Province > Ankara (0.04)
- Istanbul Province > Istanbul (0.04)
- Myanmar (0.04)
- Russia (0.04)
- Europe
- Hungary (0.04)
- Ukraine > Kyiv Oblast
- Kyiv (0.04)
- Middle East
- Cyprus > Nicosia
- Nicosia (0.04)
- Republic of Türkiye > Istanbul Province
- Istanbul (0.04)
- Cyprus > Nicosia
- Greece (0.04)
- Russia (0.04)
- Italy
- United Kingdom > England
- Tyne and Wear > Newcastle (0.04)
- Spain
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Galicia > Madrid (0.04)
- Catalonia > Barcelona Province
- Germany > Berlin (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- North America
- Mexico > Mexico City
- Mexico City (0.04)
- United States
- New Jersey > Hudson County
- Hoboken (0.04)
- Florida > Manatee County
- Bradenton (0.04)
- Massachusetts
- Hampden County > Springfield (0.04)
- Suffolk County > Boston (0.04)
- Colorado (0.04)
- Washington > King County
- Seattle (0.04)
- Virginia (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Oregon (0.04)
- New York (0.04)
- Texas (0.04)
- California > San Diego County
- San Diego (0.04)
- Nevada > Clark County
- Las Vegas (0.04)
- West Virginia (0.04)
- Illinois > Lake County
- Lake Forest (0.04)
- New Jersey > Hudson County
- Mexico > Mexico City
- Oceania > Australia
- South America
- Chile > Santiago Metropolitan Region
- Santiago Province > Santiago (0.04)
- Venezuela > Capital District
- Caracas (0.04)
- Chile > Santiago Metropolitan Region
- Genre:
- Personal (1.00)
- Research Report > New Finding (0.46)
- Industry:
- Leisure & Entertainment > Sports
- Soccer (1.00)
- Media
- Banking & Finance > Trading (1.00)
- Government
- Transportation > Air (1.00)
- Health & Medicine
- Pharmaceuticals & Biotechnology (0.93)
- Therapeutic Area > Immunology (0.94)
- Law
- Civil Rights & Constitutional Law (0.67)
- Litigation (0.93)
- Information Technology (0.67)
- Consumer Products & Services
- Restaurants (0.67)
- Travel (0.68)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.67)
- Leisure & Entertainment > Sports
- Technology: