Goto

Collaborating Authors

 Media



SceneScape: Text-Driven Consistent Scene Generation

Neural Information Processing Systems

We present a method for text-driven perpetual view generation - synthesizing long-term videos of various scenes solely from an input text prompt describing the scene and camera poses.



The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation

Neural Information Processing Systems

Extensive experiments focus on quantitative performance against benchmarks, ablations, and the model's ability to capture uncertainty and multimodality, and impute missing values.





Visual Instruction Tuning

Neural Information Processing Systems

Instruction tuning large language models (LLMs) using machine-generated instruction-following data has been shown to improve zero-shot capabilities on new tasks, but the idea is less explored in the multimodal field.


Fans Call on Taylor Swift to 'Do Better' After Accusations of Using AI for Promo Videos

WIRED

Fans Call on Taylor Swift to'Do Better' After Accusations of Using AI for Promo Videos A scavenger hunt campaign to promote Taylor Swift's new album, resulted in a viral #SwiftiesAgainstAI campaign. Fans attend a screening of at a theater in Los Angeles. These were just some of the alleged clues that fans spotted in promo videos for Taylor Swift's new album,, this weekend. They were, to their eyes, telltale indicators that the videos were purportedly made with generative AI . "The first sign that it was AI was that it didn't look great," claims Marcela Lobo, a graphic designer in Brazil who has been a Swift fan since she was 12. "It was wonky, the shadows didn't match, the windows and the painted piano, it looked like shit, basically."