Goto

Collaborating Authors

 Generative AI


Why foundation models in AI need to be released responsibly

#artificialintelligence

Percy Liang is director of the Center for Research on Foundation Models, a faculty affiliate at the Stanford Institute for Human-Centered AI and an associate professor of Computer Science at Stanford University. Humans are not very good at forecasting the future, especially when it comes to technology. Foundation models are a new class of large-scale neural networks with the ability to generate text, audio, video and images. These models will anchor all kinds of applications and hold the power to influence many aspects of society. It's difficult for anyone, even experts, to imagine where this technology will lead in the coming years.


Shifting machine learning for healthcare from development to deployment and from models to data - Nature Biomedical Engineering

#artificialintelligence

In the past decade, the application of machine learning (ML) to healthcare has helped drive the automation of physician tasks as well as enhancements in clinical capabilities and access to care. This progress has emphasized that, from model development to model deployment, data play central roles. In this Review, we provide a data-centric view of the innovations and challenges that are defining ML for healthcare. We discuss deep generative models and federated learning as strategies to augment datasets for improved model performance, as well as the use of the more recent transformer models for handling larger datasets and enhancing the modelling of clinical text. We also discuss data-focused problems in the deployment of ML, emphasizing the need to efficiently deliver data to ML models for timely clinical predictions and to account for natural data shifts that can deteriorate model performance. This Review discusses the use of deep generative models, federated learning and transformer models to address challenges in the deployment of machine learning for healthcare.


In a Latest ML Paper, OpenAI Researchers Explain How Large-Scale Language Models (LLMs) Trained on Code Open Up a Significant New Kind of Intelligent GP Enabled by ELM that is no longer at the Mercy of the Raw Search Landscape Induced by Code

#artificialintelligence

It has been shown that bootstrapping human expertise and learning from massive datasets may provide excellent results in automated code creation for Large-scale language models (LLMs). Genetic Programming (GP) is a low-resource generating methodology that may be used in conjunction with LLMs based on deep learning to get the best of both worlds. OpenAI researchers show in their new paper Evolution Through Large Models that LLMs trained to generate advanced programming languages can suggest intelligent mutations and that this ability can be helpful to realize massively improved mutation operators for GP. LLMs are taught to develop advanced programming languages. To summarize the study's primary contributions, the researchers say that: Conventional Genetic Programming (GP) uses a mutation range for the operator in order to ensure that the perturbations will have a reasonable likelihood of resulting in beneficial code modifications.


DALL-E, Make Me Another Picasso, Please

The New Yorker

Since humans invented art, sometime in the Paleolithic era, they've produced lots of pictures--"The Starry Night," some memes, that photo of Donald Trump staring at the eclipse. What does it all add up to? A few years ago, a company called OpenAI fed a good deal of those images, along with text descriptions, into the neural network of an artificial intelligence named DALL-E. DALL-E was being trained to create original art of its own, in any style, depicting in uncanny detail almost anything desired, based on written prompts. But a mastery of the entire universe of human imagery makes for difficult choices.


How #ArtificialIntelligence GANs work #dalle2 as an example, and how they will facilitate andโ€ฆ

#artificialintelligence

Artificial intelligence has made a big impact on our lives over the past few years. From online shopping bots to voice assistants like Alexa, we're now more connected than ever before thanks to AI technology. But how can we use AI in other areas? In this article, I will look at how artificial intelligence (AI) can revolutionize the world of art and design by creating new works from scratch and improving existing ones. GANs, or generative adversarial networks, are a type of deep learning model that can generate images, text and other types of data.


Synthetic data getting serious for biometrics training

#artificialintelligence

Synthetic data created by artificial intelligence systems, for AI systems is a growing market, as general adversarial networks (GANs) are used to train facial recognition and other biometric algorithms. The Washington Post profiles a company called Yuty, and the path it took to providing synthetic facial datasets, and reports that it is one of around 50 startups in the space. The Post notes that Gartner has forecast 60 percent of all AI training data will be synthetic by 2024. Amazon recently revealed that it relied heavily on synthetic data to train its palm biometrics. In a similar vein, OpenAI's DALL-E machine learning tool has updated a policy to allow its users to share synthetic facial images, after the tool's developers built in mechanisms to prevent its use in creating deepfakes, according to Vice.


AI2's Unified-IO can complete a range of AI tasks โ€“ TechCrunch

#artificialintelligence

The Allen Institute for AI (AI2), the division within the nonprofit Allen Institute focused on machine learning research, today published its work on an AI system, called Unified-IO, that it claims is among the first to perform a "large and diverse" set of AI tasks. Unified-IO can process and create images, text and other structured data, a feat that the research team behind it says is a step toward building capable, unified general-purpose AI systems. "We are interested in building task-agnostic [AI systems], which can enable practitioners to train [machine learning] models for new tasks with little to no knowledge of the underlying machinery," Jaisen Lu, a research scientist at AI2 who worked on Unified-IO, told TechCrunch via email. "Such unified architectures alleviate the need for task-specific parameters and system modifications, can be jointly trained to perform a large variety of tasks and can share knowledge across tasks to boost performance." AI2's early efforts in building unified AI systems led to GPV-1 and GPV-2, two general-purpose, "vision-language" systems that supported a handful of workloads including captioning images and answering questions.


Introducing Dall-E, The Uncomfortable Robo-Painter

#artificialintelligence

The idea being that in an evolving artistic landscape, technology will play a huge role in the next wave of masterpieces. The hope is that AI will be able to take over for the painstaking grunt work of individual cell shading and clumsy greenscreen for movies, and much more. Currently the application has been limited to a few websites that are in beta mode (essentially a testing phase), a few invitations sent out to curious users who are sharing their findings with the rest of the internet. The results have been a mix of curious, terrifying, and hilarious. Already, a new wave of memes has surged through social media, pictures of clowns on the moon, beloved television shows overrun by dinosaurs, and celebrities eating cheese.


AI Used to Create Shockingly Realistic Portraits of People Who Don't Exist

#artificialintelligence

A photographer has created portraits of people who do not exist but were instead made with the artificial intelligence (AI) program Dall-E 2. Mathieu Stern, a French photographer, used the nascent software that is not yet easily available to the public to create photorealistic portraits of fictitious people that he documented in a YouTube video. Stern, who recently made a series of wild camera designs on the program, started by instructing Dall-E to create an image of "a young beautiful woman wearing a yellow kimono, in a tropical greenhouse." "At first the lack of information about the camera, the lens, and the general look of the image, led to rather unimpressive results," Stern explains on YouTube. "So to help Dall-E, some details must be added to the general description, like the lens, the camera, the film, and adding some words like bokeh." Stern says the best results came after adding the word "Graflex."


How Imagen Actually Works

#artificialintelligence

While the Machine Learning world was still coming to terms with the impressive results of DALL-E 2, released earlier this year, Google upped the ante by releasing its own text-to-image model Imagen, which appears to push the boundaries of caption-conditional image generation even further. Imagen, released just last month, can generate high-quality, high-resolution images given only a description of a scene, regardless of how logical or plausible such a scene may be in the real world. These impressive results no doubt have many wondering how Imagen actually works. In this article, we'll explain how Imagen works at several levels. First, we will examine Imagen from a bird's-eye view in order to understand its high-level components and how they relate to one another. We'll then go into a bit more detail regarding these components, each with its own subsection, in order to understand how they themselves work. Finally, we'll perform a Deep Dive into Imagen that is intended for Machine Learning researchers, students, and practitioners. Without further ado, let's dive in! In the past few years, there has been a significant amount of progress made in the text-to-image domain of Machine Learning. A text-to-image model takes in a short textual description of a scene and then generates an image which reflects the described scene. An example input description (or "caption") and output image can be seen below: It is important to note that high-performing text-to-image models will necessarily be able to combine unrelated concepts and objects in semantically plausible ways.