The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives

Arif, Samee, Arif, Taimoor, Haroon, Muhammad Saad, Khan, Aamina Jamal, Raza, Agha Ali, Athar, Awais

Sep-19-2024–arXiv.org Artificial Intelligence

This paper introduces the concept of an education tool that utilizes Generative Artificial Intelligence (GenAI) to enhance storytelling for children. The system combines GenAI-driven narrative co-creation, text-to-speech conversion, and text-to-video generation to produce an engaging experience for learners. We describe the co-creation process, the adaptation of narratives into spoken words using text-to-speech models, and the transformation of these narratives into contextually relevant visuals through text-to-video technology. Our evaluation covers the linguistics of the generated stories, the text-to-speech conversion quality, and the accuracy of the generated visuals.

evaluation, llama-3, story generation, (16 more...)

arXiv.org Artificial Intelligence

Sep-19-2024

arXiv.org PDF

Add feedback

Country:
- North America
  - Dominican Republic (0.04)
  - United States
    - Michigan (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
- Europe
  - Bulgaria (0.04)
  - Czechia > Prague (0.04)
  - Middle East > Malta
    - Eastern Region > Northern Harbour District > St. Julian's (0.04)
- Asia > Pakistan
  - Punjab > Lahore Division > Lahore (0.04)

Genre:
- Research Report (1.00)

Industry:
- Leisure & Entertainment (1.00)
- Education (0.68)
- Media
  - Music (0.46)
  - Film (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Speech (1.00)
  - Representation & Reasoning > Agents (1.00)
  - Natural Language
    - Large Language Model (1.00)
    - Generation (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning > Generative AI (0.84)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found