Text-to-image models are dated, text-to-video is in now

#artificialintelligence 

In brief AI progresses rapidly. Just months after the release of the most advanced text-to-image models, developers are showing off text-to-video systems. Meta announced a multimodal algorithm named Make-A-Video that allows its users to type a text description of a scene as input and get a short computer-generated animated clip as output, typically depicting what was described. Other types of data, such as an image or a video, can be used as an input prompt, too. The text-to-video system was trained on public datasets, according to a non-peer reviewed paper [PDF] describing the software.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found