Story Generation from Visual Inputs: Techniques, Related Tasks, and Challenges

Oliveira, Daniel A. P., Ribeiro, Eugénio, de Matos, David Martins

Jun-4-2024–arXiv.org Artificial Intelligence

Creating engaging narratives from visual data is crucial for automated digital media consumption, assistive technologies, and interactive entertainment. This survey covers methodologies used in the generation of these narratives, focusing on their principles, strengths, and limitations. The survey also covers tasks related to automatic story generation, such as image and video captioning, and visual question answering, as well as story generation without visual inputs. These tasks share common challenges with visual story generation and have served as inspiration for the techniques used in the field. We analyze the main datasets and evaluation metrics, providing a critical perspective on their limitations.

corpusid, narrative, semanticscholar, (14 more...)

arXiv.org Artificial Intelligence

Jun-4-2024

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - Victoria > Melbourne (0.04)
- North America > United States
  - New York (0.04)
  - Texas (0.04)
  - Pennsylvania > Philadelphia County
    - Philadelphia (0.04)
  - Minnesota > Hennepin County
    - Minneapolis (0.14)
  - Michigan > Washtenaw County
    - Ann Arbor (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.04)
  - Louisiana > Orleans Parish
    - New Orleans (0.04)
  - California
    - Los Angeles County > Los Angeles (0.14)
    - San Diego County > San Diego (0.04)
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
    - Oxfordshire > Oxford (0.04)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Portugal > Lisbon
    - Lisbon (0.14)
  - Denmark > Capital Region
    - Copenhagen (0.04)

Genre:
- Overview (1.00)

Industry:
- Media (0.92)
- Leisure & Entertainment > Games
  - Computer Games (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Cognitive Science (1.00)
  - Representation & Reasoning
    - Rule-Based Reasoning (0.68)
    - Expert Systems (0.67)
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found