Google's image generator rivals DALL-E in shiba inu drawing – TechCrunch
The AI world is still figuring out how to deal with the amazing show of prowess that is DALL-E 2's ability to draw/paint/imagine just about anything… but OpenAI isn't the only one working on something like that. Google Research has rushed to publicize a similar model it's been working on -- which it claims is even better. Imagen (get it?) is a text-to-image diffusion-based generator built on large transformer language models that… okay, let's slow down and unpack that real quick. Text-to-image models take text inputs like "a dog on a bike" and produce a corresponding image, something that has been done for years but recently has seen huge jumps in quality and accessibility. Part of that is using diffusion techniques, which basically start with a pure noise image and slowly refine it bit by bit until the model thinks it can't make it look any more like a dog on a bike than it already does.
May-24-2022, 00:35:09 GMT
- Technology: