Lumi\`ereNet: Lecture Video Synthesis from Audio

Jul-4-2019–arXiv.org Machine Learning

We present Lumi\`ereNet, a simple, modular, and completely deep-learning based architecture that synthesizes, high quality, full-pose headshot lecture videos from instructor's new audio narration of any length. Unlike prior works, Lumi\`ereNet is entirely composed of trainable neural network modules to learn mapping functions from the audio to video through (intermediate) estimated pose-based compact and abstract latent codes. Our video demos are available at [22] and [23].

artificial intelligence, machine learning, video, (18 more...)

arXiv.org Machine Learning

Jul-4-2019

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - New York (0.04)
  - Georgia > Fulton County
    - Atlanta (0.04)
- Europe
  - United Kingdom > England
    - Oxfordshire > Oxford (0.04)
  - Italy > Calabria
    - Catanzaro Province > Catanzaro (0.04)

Genre:
- Research Report (0.64)
- Instructional Material > Course Syllabus & Notes (0.46)

Industry:
- Education
  - Educational Setting > Online (1.00)
  - Educational Technology > Educational Software
    - Computer Based Training (1.00)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found