Generative AI
Let's Force an AI to Design Us Some Cars
Khyzyl Saleem is an incredible automotive designer, responsible for some of the most out-there looks ever to sit on four wheels. He also has a fairly distinctive style, combining wide bodies with slammed suspensions, track-day aero, neon lighting, and lots of vents and intakes. How will DALL-E apply that aesthetic to the humble 944? DALL-E took a long time to ponder on this one, but it was worth every second. I swear I've seen that barcode-looking license plate before on a particularly cyberpunk Khyzyl design, though I can't quite recall which one.
Artificial General Intelligence Is Not as Imminent as You Might Think
To the average person, it must seem as if the field of artificial intelligence is making immense progress. According to the press releases, and some of the more gushing media accounts, OpenAI's DALL-E 2 can seemingly create spectacular images from any text; another OpenAI system called GPT-3 can talk about just about anything; and a system called Gato that was released in May by DeepMind, a division of Alphabet, seemingly worked well on every task the company could throw at it. One of DeepMind's high-level executives even went so far as to brag that in the quest for artificial general intelligence (AGI), AI that has the flexibility and resourcefulness of human intelligence, "The Game is Over!" And Elon Musk said recently that he would be surprised if we didn't have artificial general intelligence by 2029. Machines may someday be as smart as people, and perhaps even smarter, but the game is far from over.
A good alternative to DALL·E 2 that you can use while waiting
DALLE-2 can produce photorealistic graphics from natural language descriptions. While such models are adaptable, they fail to comprehend some notions, such as object relationships. This article proposes a newest AI art generating method. A alternative diffusion model can generate images conditioned on your sentence descriptions.
Deepmind: Is "Gato" a precursor for general artificial intelligence?
Deepmind's Gato solves many tasks, but none of them really well. Does the new AI system nevertheless lead the way for general artificial intelligence? Hot on the heels of OpenAI's DALL-E 2, Google's PaLM, LaMDA 2, and Deepmind's Chinchilla and Flamingo, the London-based AI company is showing off another large AI model that outperforms existing systems. Yet Deepmind's Gato is different: The model can't text better, describe images better, play Atari better, control robotic arms better, or orient itself in 3D spaces better than other AI systems. But Gato can do a bit of everything. Deepmind trained the Transformer-based multi-talent with images, text, proprioception, joint moments, keystrokes, and other discrete and continuous observations and actions.
Artificial intelligence text-to-image tool may use its own 'secret language', experts claim
An artificial intelligence (AI) tool that can transform famous paintings into different art styles, or create brand new artworks from a text prompt, may work by using a'secret language', experts claim. Text-to-image app DALL-E 2 was released by artificial intelligence lab OpenAI last month, and is able to create multiple realistic images and artwork from a single text prompt. It is also able to add objects into existing images, or even provide different points of view on an existing image. Now researchers believe they may have figured out how the technology works, after discovering that gibberish words produce specific pictures. Computer scientists used DALL-E 2 to generate images that contained text inside them, by asking for'captions' or'subtitles'.
OpenAI's DALL·E 2 doesn't understand some secret language
In brief AI text-to-image generation models are all the rage right now. You give them a simple description of a scene, such as "a vulture typing on a laptop," and they come up with an illustration that resembles that description. But developers who have special access to OpenAI's text-to-image engine DALL·E 2 have found all sorts of weird behaviors – including what may be a hidden, made-up language. Giannis Daras, a PhD student at the University of Texas at Austin shared artwork produced by DALL·E 2 given the input: "Apoploe vesrreaitais eating Contarra ccetnxniams luryca tanniounons" – a phrase that makes no sense to humans. But to the machine, it seemed to generate images of birds eating bugs consistently.
Do AI systems really have their own secret language?
A new generation of artificial intelligence (AI) models can produce "creative" images on-demand based on a text prompt. While the output of these models is often striking, it's hard to know exactly how they produce their results. Last week, researchers in the US made the intriguing claim that the DALL-E 2 model might have invented its own secret language to talk about objects. By prompting DALL-E 2 to create images containing text captions, then feeding the resulting (gibberish) captions back into the system, the researchers concluded DALL-E 2 thinks Vicootes means "vegetables", while Wa ch zod rea refers to "sea creatures that a whale might eat". These claims are fascinating, and if true, could have important security and interpretability implications for this kind of large AI model.
Why the AGI discussion is getting heated again
And right now, we are in the midst of one of those cycles. Tech entrepreneurs are warning about the alien invasion of AGI. The media is awash with reports of AI systems that are mastering language and moving toward generalization. And social media is filled with heated discussions about deep neural networks and consciousness. Recent years have seen some truly impressive advances in AI, and scientists have been able to make progress in some of the most challenging areas of the field.
How to get Codex to produce the code you want!
Have you seen AI models that can generate code for you? Well, if you haven't, you're going to see them a lot more soon thanks to models like OpenAI's Codex models. Codex is a family of AI models from Open AI that translates between natural language and code in more than a dozen programming languages. The power of these AI models is that you can quickly develop and iterate on your ideas and build products that help people do more. Here is an example how you can have a conversation with a Minecraft character and have it follow your instructions by generating Minecraft API commands behind the scenes. This article will show you how to get models like Codex to generate code you want using a technique called Prompt Engineering.