Generative AI
Meta unveils artificial intelligence-generated video
Meta announced that it was taking artificial intelligence-generated art to the next level by allowing users to create short video clips by just typing in a string of descriptive statements. Meta's AI division announced Thursday that it was unveiling Make-a-Video, an AI system that allows users to turn text prompts into short video clips of whatever was described. "Generative AI research is pushing creative expression forward by giving people tools to quickly and easily create new content," Meta said in a post describing the new technology. "With just a few words or lines of text, Make-A-Video can bring imagination to life and create one-of-a-kind videos full of vivid colors, characters, and landscapes." We're pleased to introduce Make-A-Video, our latest in #GenerativeAI research!
Stability AI is the New OpenAI
Stability AI, the company behind Stable Diffusion, is trying to democratise AI and use its potential to'awaken humanity's potential'. However, if you have closely followed the AI ecosystem over the past few years, this might give you a sense of déjà vu. In 2015, Sam Altman and Elon Musk announced the formation of OpenAI, a non-profit company with similar motives – to democratise AI and open its research and models to the public. However, DALL-E2, by OpenAI, is not open-source; the company charges its users. Hence, with Stability AI in the picture, it made many of us wonder if Stability AI will toe the OpenAI line.
Anyone can now use powerful AI tools to make images. What could possibly go wrong?
If you've ever wanted to use artificial intelligence to quickly design a hybrid between a duck and a corgi, now is your time to shine. On Wednesday, OpenAI announced that anyone can now use the most recent version of its AI-powered DALL-E tool to generate a seemingly limitless range of images just by typing in a few words, months after the startup began gradually rolling it out to users. The move will likely expand the reach of a new crop of AI-powered tools that have already attracted a wide audience and challenged our fundamental ideas of art and creativity. But it could also add to concerns about how such systems could be misused when widely available. "Learning from real-world use has allowed us to improve our safety systems, making wider availability possible today," OpenAI said in a blog post.
Want to Check Your Tech Creativity? Well, Dall-E.2 is Waiting for You All!
Users can now sign up and start using AI Image Generator without waiting. Dall-E.2 is a machine learning model that can generate images based on a test description in natural language. The company revealed the original Dall-E in January 2021, with the tool impressing both AI experts and the public with its ability to turn any text description or prompt into a unique image. Dall-E is a transformer language model that has been trained on a large number of images and accompanying captions. This permits it to generate original images from scratch using just text descriptions.
Working With AI – The Passive Voice
In August, first prize in the digital-art category of the Colorado State Fair's fine-art competition went to a man who used artificial intelligence (AI) to generate his submission, "Théâtre d'Opéra Spatial." He supplied the AI, a program called Midjourney, with only a "prompt"--a textual description of what he wanted. Systems like Midjourney and the similar DALL-E 2 have led to a new role in our AI age: "prompt engineer." Such people can even sell their textual wares in an online market called PromptBase. Midjourney and DALL-E 2 emerged too late to be included in "Working With AI: Real Stories of Human-Machine Collaboration," by Thomas Davenport and Steven Miller, information-systems professors at Babson College and Singapore Management University, respectively.
Meta's new text-to-video AI generator is like DALL-E for video
The researchers note in the paper that the model has many technical limitations beyond blurry footage and disjointed animation. For example, their training methods are unable to learn information that might only be inferred by a human watching a video -- e.g., whether a video of a waving hand is going left to right or right to left. Other problems include generating videos longer than five seconds, videos with multiple scenes and events, and higher resolution. Make-A-Video currently outputs 16 frames of video at a resolution of 64 by 64 pixels, which are then boosted in size using a separate AI model to 768 by 768.
DALL-E is now available to all. NPR put it to work
"A Cubist painting of a mug with the NPR logo on it, on a table next to a old-timey radio" Image generated by DALL-E/OpenAI hide caption An artificial intelligence tool called DALL-E that's stunned with its ability to render text into realistic images is now available to the public. OpenAI, the Silicon Valley research lab behind the program, announced Wednesday it has dropped the waitlist to use the program. Until now, OpenAI released the tool to a select group of users that included academics, artists and journalists. The iterative rollout was designed to curb the potential for bad actors to leverage the tool for disinformation and other harmful uses. The excitement over the invite-only tool had meanwhile inspired an imitation known as DALL-E mini, a limited model in comparison that's not affiliated with OpenAI.
Simplifying AI Can Optimize Your Entire Business
Artificial intelligence is becoming less of a futuristic technology and a more integral aspect of today's business landscape. The usage of AI across the business universe is revolutionizing every industry, and Gartner reports that at least 75% of organizations use deep neural networks today. In financial departments, AI is automating menial tasks and reducing errors in traditional manual workflows. There's no doubt that businesses utilizing the right AI for the right reasons are seeing exponential benefits. Unfortunately, not every business unit is as excited about the available AI solutions that finance departments are gifted with.