Lumi\`ereNet: Lecture Video Synthesis from Audio
Kim, Byung-Hak, Ganapathi, Varun
We present Lumi\`ereNet, a simple, modular, and completely deep-learning based architecture that synthesizes, high quality, full-pose headshot lecture videos from instructor's new audio narration of any length. Unlike prior works, Lumi\`ereNet is entirely composed of trainable neural network modules to learn mapping functions from the audio to video through (intermediate) estimated pose-based compact and abstract latent codes. Our video demos are available at [22] and [23].
Jul-4-2019
- Country:
- North America > United States
- New York (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Europe
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Italy > Calabria
- Catanzaro Province > Catanzaro (0.04)
- United Kingdom > England
- North America > United States
- Genre:
- Research Report (0.64)
- Instructional Material > Course Syllabus & Notes (0.46)
- Industry:
- Technology: