Now Microsoft wants a share of the 'AI image generator' pie
Text-to-image generative models like OpenAI's DALL-E 2 are attracting significant attention because of their ability to produce images merely based on text prompts. While DALL-E 2 is the most popular, there are other budding AI image generators such as Ultraleap's'Midjourney', Hugging Face's'Craiyon', Meta's'Make-A-Scene' and Google's'Imagen'. Now, it seems that Microsoft also wants a share of the'AI image generator' pie. Recently, Microsoft's Asia research team introduced NUWA-Infinity, which is a multimodal generative model designed to generate high-quality images and videos from any given text, image or video input. In its research paper titled, 'NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis', Microsoft said that they evaluated NUWA-Infinity on five high-resolution visual synthesis tasks-- Compared to its predecessor'NUWA', which also covers images and videos, NUWA-Infinity has superior visual synthesis capabilities in terms of resolution and variable-size generation.
Aug-5-2022, 21:05:14 GMT