Text-to-image models are dated, text-to-video is in now
In brief AI progresses rapidly. Just months after the release of the most advanced text-to-image models, developers are showing off text-to-video systems. Meta announced a multimodal algorithm named Make-A-Video that allows its users to type a text description of a scene as input and get a short computer-generated animated clip as output, typically depicting what was described. Other types of data, such as an image or a video, can be used as an input prompt, too. The text-to-video system was trained on public datasets, according to a non-peer reviewed paper [PDF] describing the software.
Oct-2-2022, 04:20:14 GMT
- Country:
- Europe > United Kingdom (0.16)
- North America > United States (0.32)
- Industry:
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning > Neural Networks (0.32)
- Natural Language (0.99)
- Vision (0.73)
- Information Technology > Artificial Intelligence