TutoAI: A Cross-domain Framework for AI-assisted Mixed-media Tutorial Creation on Physical Tasks

Chen, Yuexi, Morariu, Vlad I., Truong, Anh, Liu, Zhicheng

Mar-12-2024–arXiv.org Artificial Intelligence

Mixed-media tutorials, which integrate videos, images, text, and diagrams to teach procedural skills, offer more browsable alternatives than timeline-based videos. However, manually creating such tutorials is tedious, and existing automated solutions are often restricted to a particular domain. While AI models hold promise, it is unclear how to effectively harness their powers, given the multi-modal data involved and the vast landscape of models. We present TutoAI, a cross-domain framework for AI-assisted mixed-media tutorial creation on physical tasks. First, we distill common tutorial components by surveying existing work; then, we present an approach to identify, assemble, and evaluate AI models for component extraction; finally, we propose guidelines for designing user interfaces (UI) that support tutorial creation based on AI-generated components. We show that TutoAI has achieved higher or similar quality compared to a baseline model in preliminary user studies.

mixed-media tutorial, tutorial, video, (15 more...)

arXiv.org Artificial Intelligence

Mar-12-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Colorado (0.04)
  - New York > New York County
    - New York City (0.04)
  - Maryland > Prince George's County
    - College Park (0.04)
  - Hawaii > Honolulu County
    - Honolulu (0.05)
  - California > San Francisco County
    - San Francisco (0.14)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Workflow (1.00)
- Research Report (1.00)
- Instructional Material > Course Syllabus & Notes (1.00)
- Questionnaire & Opinion Survey (0.86)

Industry:
- Health & Medicine (0.69)
- Education
  - Educational Technology (0.97)
  - Educational Setting > Online (0.93)

Technology:
- Information Technology
  - Human Computer Interaction > Interfaces (1.00)
  - Communications > Social Media (1.00)
  - Artificial Intelligence
    - Vision (1.00)
    - Natural Language > Large Language Model (1.00)
    - Representation & Reasoning (0.93)
    - Machine Learning > Neural Networks (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found