Generative AI
LogicRank: Logic Induced Reranking for Generative Text-to-Image Systems
Deiseroth, Björn, Schramowski, Patrick, Shindo, Hikaru, Dhami, Devendra Singh, Kersting, Kristian
Text-to-image models have recently achieved remarkable success with seemingly accurate samples in photo-realistic quality. However as state-of-the-art language models still struggle evaluating precise statements consistently, so do language model based image generation processes. In this work we showcase problems of state-of-the-art text-to-image models like DALL-E with generating accurate samples from statements related to the draw bench benchmark. Furthermore we show that CLIP is not able to rerank those generated samples consistently. To this end we propose LogicRank, a neuro-symbolic reasoning framework that can result in a more accurate ranking-system for such precision-demanding settings. LogicRank integrates smoothly into the generation process of text-to-image models and moreover can be used to further fine-tune towards a more logical precise model.
GLM-130B: The most capable AI language model currently available comes from China
A Chinese language model performs better than OpenAI's GPT-3 and Google's PaLM. Huawei shows a Codex alternative. Large AI models for language, code, and images play a central role in the current proliferation of artificial intelligence. Researchers at Stanford University therefore even want to call such models "foundation models." The pioneer in the development of very large AI models is the U.S. AI company OpenAI, whose GPT-3 language model first demonstrated the usefulness of such AI systems.
Picasso VS Robots: The Human Touch Faces Off Against AI - Scoop Empire
Movies over the years showed the rise of artificial intelligence taking over and enslaving humanity, think "The Matrix" and "The Terminator," and while that has yet to happen (thankfully), recently, there has been an attention-grabbing rise of people sharing AI-generated art. Some of whom clearly stated that the pictures were made using AI, while others claimed that its theirs; which made us wonder, will AI steal the art scene from humans? It isn't out of the ordinary to see machines replacing humans. It has happened several times before, such as in car production lines or anything related to mass production, due to the machines' accuracy, effectiveness, and obvious perfect health. However, we've never seen it take over a human's post in something that needed creativity and talent on this scale before. Using AI in art isn't new in fact, there was an AI-generated portrait sold in 2018 at an auction for a whopping $432,500, but the latest two AI platforms making waves in the scene today are Midjourney and DALL·E 2. DALL·E (named as an homage to Salvador Dali) creates images based on the users' input using an intricate deep-learning method; the developers behind the platform enhanced it even more, releasing the second iteration DALL·E 2 in 2021.
Deepfakes: Uncensored AI art model prompts ethics questions – TechCrunch
A new open source AI image generator capable of producing realistic pictures from any text prompt has seen stunningly swift uptake in its first week. Stability AI's Stable Diffusion, high fidelity but capable of being run on off-the-shelf consumer hardware, is now in use by art generator services like Artbreeder, Pixelz.ai and more. But the model's unfiltered nature means not all the use has been completely above board. For the most part, the use cases have been above board. For example, NovelAI has been experimenting with Stable Diffusion to produce art that can accompany the AI-generated stories created by users on its platform.
"The Scariest" after "The Funniest": AI-generated Images Are Crashing the Internet – IndustryWired
The Artificial Intelligence system is trained on millions of real images, which helps them understand the patterns to respond to user queries. The scariest image, it pulled together nightmarish images to deliver the final result. Those images are what nightmares are made of. The AI created several similar images of a monster-like creature that's the stuff of nightmares. The images have been created by Craiyon AI, formerly known as the DALL-E mini AI image generator.
AI Creating 'Art' Is An Ethical And Copyright Nightmare
It's August 2022, and by now you've no doubt read (or more likely seen) something about AI art by now. Whether it's random jokes made for Twitter or paintings that look like they were made by actual human beings, artificial intelligence's ability to create art has exploded onto the scene over the last few months, and while this has been great news for shitposts and fans of tech, it has also raised a number of important questions and concerns. If you haven't read or seen anything about the subject, AI art--or at least as it exists in the state we know it today--is, as Ahmed Elgammal writing in American Scientist so neatly puts it, made when "artists write algorithms not to follow a set of rules, but to'learn' a specific aesthetic by analyzing thousands of images. The algorithm then tries to generate new images in adherence to the aesthetics it has learned." Currently there are a handful of prominent platforms that people are using, with three of the most popular being Midjourney, Dall-E and Stable Diffusion.
Understanding Diffusion Models: A Unified Perspective
Diffusion models have shown incredible capabilities as generative models; indeed, they power the current state-of-the-art models on text-conditioned image generation such as Imagen and DALL-E 2. In this work we review, demystify, and unify the understanding of diffusion models across both variational and score-based perspectives. We first derive Variational Diffusion Models (VDM) as a special case of a Markovian Hierarchical Variational Autoencoder, where three key assumptions enable tractable computation and scalable optimization of the ELBO. We then prove that optimizing a VDM boils down to learning a neural network to predict one of three potential objectives: the original source input from any arbitrary noisification of it, the original source noise from any arbitrarily noisified input, or the score function of a noisified input at any arbitrary noise level. We then dive deeper into what it means to learn the score function, and connect the variational perspective of a diffusion model explicitly with the Score-based Generative Modeling perspective through Tweedie's Formula. Lastly, we cover how to learn a conditional distribution using diffusion models via guidance.
We need to talk about how good AI is getting
For the past few days, I've been playing around with DALL-E 2, an app developed by the San Francisco company OpenAI that turns text descriptions into hyper-realistic images. OpenAI invited me to test DALL-E 2 (the name is a play on Pixar's WALL-E and artist Salvador Dalí) during its beta period, and I quickly got obsessed. I spent hours thinking up weird, funny and abstract prompts to feed the AI -- "a 3D rendering of a suburban home shaped like a croissant," "an 1850s daguerreotype portrait of Kermit the Frog," "a charcoal sketch of two penguins drinking wine in a Parisian bistro." This could be due to a conflict with your ad-blocking or security software. Please add japantimes.co.jp and piano.io to your list of allowed sites.