Goto

Collaborating Authors

 Generative AI


AI delivers a hilarious upgrade to '90s video game characters

#artificialintelligence

AI image generators have been responsible for so much in the last few months. Barely a day goes by without us discovering some new experiment, whether it's resurrecting late celebs or creating a tool so we can turn anyone into a Pokรฉmon. Now someone's used them to enhance what 30 years ago was the cutting edge in video game graphics. An artist has fed the characters of Sega's nineties fighting game Virtua Fighter through an AI image generator, turning the original 3D polygon graphics into photorealistic images (well, almost). To see how this technology works, take a look at our piece on how to use DALL-E 2 (opens in new tab).


Adobe commits to transparency in use of generative AI

#artificialintelligence

Did you miss a session from MetaBeat 2022? Head over to the on-demand library for all of our featured sessions here. Today, at Adobe MAX, billed as the world's largest creativity conference, Adobe announced its commitment to support creatives by ensuring transparency in the use of generative AI tools. In a year dominated by the rise of generative AI tools โ€“ such as OpenAI's DALL-E 2, Google's Imagen, Stable Diffusion and MidJourney โ€“ Adobe, the world's leading computer graphics software company, said its approach to developing creator-centric generative AI offerings would leverage its Content Authenticity Initiative (CAI) standards and invest in new research to support creatives' control over their style and work. The CAI is an Adobe-led initiative that enables creators to securely attach provenance data to digital content, helping ensure creators get credit for their work and audiences understand who made a piece of content and how it was created.


Bizarre Halloween Candy Courtesy of AI Tool Dall-E: Farte Cats, Anyone?

#artificialintelligence

The DIY art world hasn't been the same since the passing of Bob Ross (rest in peace in a forest of happy little trees, king), but AI art creation tool Dall-E at least offers an entertaining and quicker way to generate masterpieces that seem appropriate for the bizarro timeline we all now share. AI researcher Janelle Shane has made a hobby of prompting machine learning systems to engage in the 2022 equivalent of some very weird improv comedy. Her interactions with AI prompt plenty of hilarity -- including a charming set of Valentine's Day cards that almost work. Her latest bit is simply asking Dall-E to paint a picture of the most popular Halloween candies in each US state. The results are filled with lots of odd gibberish and overflowing with candy corn like any child's basket on Nov. 1 when all the good stuff from the previous night's haul has already been devoured. The first thing that becomes clear as Shane starts to work her way through all 50 states alphabetically is that Dall-E associates Halloween treats very strongly with candy corn, which is pretty fair given its abundance this time of year.


Identifiability of deep generative models without auxiliary information

arXiv.org Artificial Intelligence

We prove identifiability of a broad class of deep latent variable models that (a) have universal approximation capabilities and (b) are the decoders of variational autoencoders that are commonly used in practice. Unlike existing work, our analysis does not require weak supervision, auxiliary information, or conditioning in the latent space. Specifically, we show that for a broad class of generative (i.e. unsupervised) models with universal approximation capabilities, the side information $u$ is not necessary: We prove identifiability of the entire generative model where we do not observe $u$ and only observe the data $x$. The models we consider match autoencoder architectures used in practice that leverage mixture priors in the latent space and ReLU/leaky-ReLU activations in the encoder, such as VaDE and MFC-VAE. Our main result is an identifiability hierarchy that significantly generalizes previous work and exposes how different assumptions lead to different "strengths" of identifiability, and includes certain "vanilla" VAEs with isotropic Gaussian priors as a special case. For example, our weakest result establishes (unsupervised) identifiability up to an affine transformation, and thus partially resolves an open problem regarding model identifiability raised in prior work. These theoretical results are augmented with experiments on both simulated and real data.


Mediamorphosis: How AI is enabling a new paradigm for work and play

#artificialintelligence

Did you miss a session from MetaBeat 2022? Head over to the on-demand library for all of our featured sessions here. Text-to-image AI systems such as DALL-E 2, Imagen and Midjourney are growing in popularity and capability right now, offering creators a revolutionary new way to produce content. Generating images from text prompts is a radical new approach to art-making and creative expression. But it also gives us the first glimpse of a fundamental shift in how we can better communicate and collaborate with our machines.


Four thoughts on AI deep learning in 2022

#artificialintelligence

This article is part of a VB special issue. Read the full series here: How Data Privacy Is Transforming Marketing. We're putting another year of exciting developments in artificial intelligence (AI) deep learning behind us โ€“ one filled with remarkable progress, controversies and, of course, disputes. As we wrap up 2022 and prepare to embrace what 2023 has in store, here are some of the most notable overarching trends that marked this year in deep learning. One theme that has remained constant in deep learning over the past few years is the drive to create bigger neural networks.


Microsoft Bing Is Getting An AI Image Generator

#artificialintelligence

Microsoft Bing is getting an AI image generator in the coming weeks, which allows users to turn text into digital art. Let's say a picture of a Shiba Inu as an astronaut would go perfectly with a blog post you're writing. You turn to the search engines for a free-to-use image, but you can't find one that matches your criteria. With the new Image Creator tool coming to Microsoft Bing, you can generate the exact image you need by inputting descriptive text. Image Creator is powered by DALL-E 2 image generator technology developed by OpenAI.


Multi-objective Deep Data Generation with Correlated Property Control

arXiv.org Artificial Intelligence

Developing deep generative models has been an emerging field due to the ability to model and generate complex data for various purposes, such as image synthesis and molecular design. However, the advancement of deep generative models is limited by challenges to generate objects that possess multiple desired properties: 1) the existence of complex correlation among real-world properties is common but hard to identify; 2) controlling individual property enforces an implicit partially control of its correlated properties, which is difficult to model; 3) controlling multiple properties under various manners simultaneously is hard and under-explored. We address these challenges by proposing a novel deep generative framework, CorrVAE, that recovers semantics and the correlation of properties through disentangled latent vectors. The correlation is handled via an explainable mask pooling layer, and properties are precisely retained by generated objects via the mutual dependence between latent vectors and properties. Our generative model preserves properties of interest while handling correlation and conflicts of properties under a multi-objective optimization framework. The experiments demonstrate our model's superior performance in generating data with desired properties.


Microsoft Ignite: 8 Azure AI updates to boost productivity

#artificialintelligence

Did you miss a session from MetaBeat 2022? Head over to the on-demand library for all of our featured sessions here. Today Microsoft Azure announced a variety of enhancements across its AI services at Ignite 2022. The company says the updates will help people "work smarter, not harder by bringing more intelligence, insights and value to the hands of customers." The product updates include new innovations in Azure Applied AI Services to help customers automate mundane tasks and serve end-users in multiple languages worldwide; updates to Azure Cognitive Services to "enrich and simplify" the creation of AI apps with pre-built models and text-to-image generation; and new capabilities in Azure Machine Learning that boost the productivity of developers and data scientists of all skill levels, and help further responsible AI deployment.


What Happens if You Let a 4-Year-Old Use an AI Art Generator?

#artificialintelligence

If you've ever been around kids, you know how limitless their imaginations can be. A simple cardboard box is an endless world of possibilities. What if we let that imagination run wild with DALL-E and Stable Diffusion? That's the amazing thing about the many AI image generators that are available now. Obviously, there are limits to what the AI can do, but you are basically free to enter anything and see what happens.