The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives
Arif, Samee, Arif, Taimoor, Haroon, Muhammad Saad, Khan, Aamina Jamal, Raza, Agha Ali, Athar, Awais
–arXiv.org Artificial Intelligence
This paper introduces the concept of an education tool that utilizes Generative Artificial Intelligence (GenAI) to enhance storytelling for children. The system combines GenAI-driven narrative co-creation, text-to-speech conversion, and text-to-video generation to produce an engaging experience for learners. We describe the co-creation process, the adaptation of narratives into spoken words using text-to-speech models, and the transformation of these narratives into contextually relevant visuals through text-to-video technology. Our evaluation covers the linguistics of the generated stories, the text-to-speech conversion quality, and the accuracy of the generated visuals.
arXiv.org Artificial Intelligence
Sep-19-2024
- Country:
- Asia > Pakistan
- Punjab > Lahore Division > Lahore (0.04)
- Europe
- Bulgaria (0.04)
- Czechia > Prague (0.04)
- Middle East > Malta
- Eastern Region > Northern Harbour District > St. Julian's (0.04)
- North America
- Dominican Republic (0.04)
- United States
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Michigan (0.04)
- Louisiana > Orleans Parish
- Asia > Pakistan
- Genre:
- Research Report (1.00)
- Industry:
- Education (0.68)
- Leisure & Entertainment (1.00)
- Media
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning > Neural Networks
- Deep Learning > Generative AI (0.84)
- Natural Language
- Generation (1.00)
- Large Language Model (1.00)
- Representation & Reasoning > Agents (1.00)
- Speech (1.00)
- Machine Learning > Neural Networks
- Information Technology > Artificial Intelligence