Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Mogadala, Aditya (Saarland University) | Kalimuthu, Marimuthu (Saarland University) | Klakow, Dietrich (Saarland University)
–Journal of Artificial Intelligence Research
Interest in Artificial Intelligence (AI) and its applications has seen unprecedented growth in the last few years. This success can be partly attributed to the advancements made in the sub-fields of AI such as machine learning, computer vision, and natural language processing. Much of the growth in these fields has been made possible with deep learning, a sub-area of machine learning that uses artificial neural networks. This has created significant interest in the integration of vision and language. In this survey, we focus on ten prominent tasks that integrate language and vision by discussing their problem formulation, methods, existing datasets, evaluation measures, and compare the results obtained with corresponding state-of-the-art methods. Our efforts go beyond earlier surveys which are either task-specific or concentrate only on one type of visual content, i.e., image or video. Furthermore, we also provide some potential future directions in this field of research with an anticipation that this survey stimulates innovative thoughts and ideas to address the existing challenges and build new applications.
Journal of Artificial Intelligence Research
Aug-30-2021
- Country:
- Oceania > Australia
- Victoria > Melbourne (0.04)
- Western Australia > Perth (0.04)
- North America
- United States
- Illinois (0.04)
- Texas > Travis County
- Austin (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Nevada > Clark County
- Las Vegas (0.04)
- Colorado > Denver County
- Denver (0.04)
- Massachusetts > Suffolk County
- Boston (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- New York
- New York County > New York City (0.14)
- Richmond County > New York City (0.04)
- Queens County > New York City (0.04)
- Kings County > New York City (0.04)
- Bronx County > New York City (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Utah > Salt Lake County
- Salt Lake City (0.04)
- Washington > King County
- Seattle (0.04)
- California
- Los Angeles County > Long Beach (0.04)
- San Diego County > San Diego (0.04)
- Santa Clara County > Palo Alto (0.04)
- Puerto Rico > San Juan
- San Juan (0.04)
- Canada
- Quebec > Montreal (0.04)
- Ontario > Toronto (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.14)
- Alberta > Census Division No. 15
- Improvement District No. 9 > Banff (0.04)
- United States
- Europe
- Sweden > Stockholm
- Stockholm (0.04)
- Switzerland > Zürich
- Zürich (0.14)
- Netherlands > North Holland
- Amsterdam (0.04)
- France > Hauts-de-France
- Germany
- Berlin (0.04)
- Saarland > Saarbrücken (0.04)
- North Rhine-Westphalia > Münster Region
- Münster (0.04)
- Bavaria > Upper Bavaria
- Munich (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Spain > Andalusia
- Granada Province > Granada (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Italy
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Sweden > Stockholm
- Asia
- East Asia (0.04)
- South Korea > Seoul
- Seoul (0.04)
- Middle East
- Japan > Honshū
- Kansai > Osaka Prefecture > Osaka (0.04)
- China > Shandong Province
- Qingdao (0.04)
- Africa
- Ethiopia > Addis Ababa
- Addis Ababa (0.04)
- Central African Republic > Ombella-M'Poko
- Bimbo (0.04)
- Ethiopia > Addis Ababa
- Oceania > Australia
- Genre:
- Research Report > Promising Solution (1.00)
- Overview (1.00)
- Industry:
- Media > Film (1.00)
- Leisure & Entertainment > Sports (1.00)
- Information Technology (0.92)
- Education > Curriculum
- Subject-Specific Education (0.34)
- Technology: