Generative AI
Colossal-AI Seamlessly Accelerates Large Models at Low Costs with Hugging Face
Forbes News, the world's leading voice, recently declared large AI models as one of six AI trends to watch for in 2022. As large-scale AI models continue their superior performances across different domains, trends emerge, leading to distinguished and efficient AI applications that have never been seen in the industry. For example, Microsoft-owned GitHub and OpenAI partnered to launch Copilot recently. Copilot plays the role of an AI pair programmer, offering suggestions for code and entire functions in real-time. Such developments continue to make coding easier than before.
AI21 Labs raises $64M to help it compete against OpenAI
AI21 Labs has raised $64 million in a funding round to help it compete against OpenAI and other NLP leaders. Competition in NLP (Natural Language Processing) is heating up. OpenAI is currently seen as the industry leader with its GPT-3 model but rivals are gaining traction. Investors see AI21 Labs as one of the most promising contenders. "We completed this round during a period of market uncertainty, which highlights the confidence our investors have in AI21's vision to change the way people consume and produce information," said Ori Goshen, Co-Founder and Co-CEO of AI21 Labs.
Attacking Machine Learning with Adversarial Examples
Adversarial examples are inputs to machine learning models that an attacker has intentionally designed to cause the model to make a mistake; they're like optical illusions for machines. In this post we'll show how adversarial examples work across different mediums, and will discuss why securing systems against them can be difficult. At OpenAI, we think adversarial examples are a good aspect of security to work on because they represent a concrete problem in AI safety that can be addressed in the short term, and because fixing them is difficult enough that it requires a serious research effort. To get an idea of what adversarial examples look like, consider this demonstration from Explaining and Harnessing Adversarial Examples: starting with an image of a panda, the attacker adds a small perturbation that has been calculated to make the image be recognized as a gibbon with high confidence. The approach is quite robust; recent research has shown adversarial examples can be printed out on standard paper then photographed with a standard smartphone, and still fool systems.
When AI Makes Art, Humans Supply the Creative Spark
New products often come with disclaimers, but in April the artificial intelligence company OpenAI issued an unusual warning when it announced a new service called DALL-E 2. The system can generate vivid and realistic photos, paintings, and illustrations in response to a line of text or an uploaded image. One part of OpenAI's release notes cautioned that "the model may increase the efficiency of performing some tasks like photo editing or production of stock photography, which could displace jobs of designers, photographers, models, editors, and artists." So far, that hasn't come to pass. People who have been granted early access to DALL-E have found that it elevates human creativity rather than making it obsolete. Benjamin Von Wong, an artist who creates installations and sculptures, says it has, in fact, increased his productivity. "DALL-E is a wonderful tool for someone like me who cannot draw," says Von Wong, who uses the tool to explore ideas that could later be built into physical works of art.
La veille de la cybersécurité
Unlike other, more famous large language models such as OpenAI's GPT-3 and Google's LaMDA, BLOOM (which stands for BigScience Large Open-science Open-access Multilingual Language Model) is designed to be as transparent as possible, with researchers sharing details about the data it was trained on, the challenges in its development, and the way they evaluated its performance. OpenAI and Google have not shared their code or made their models available to the public, and external researchers have very little understanding of how these models are trained. BLOOM was created over the last year by over 1,000 volunteer researchers in a project called BigScience, which was coordinated by AI startup Hugging Face using funding from the French government. It officially launched on July 12. The researchers hope developing an open-access LLM that performs as well as other leading models will lead to long-lasting changes in the culture of AI development and help democratize access to cutting-edge AI technology for researchers around the world.
ConvGeN: Convex space learning improves deep-generative oversampling for tabular imbalanced classification on smaller datasets
Schultz, Kristian, Bej, Saptarshi, Hahn, Waldemar, Wolfien, Markus, Srivastava, Prashant, Wolkenhauer, Olaf
Data is commonly stored in tabular format. Several fields of research are prone to small imbalanced tabular data. Supervised Machine Learning on such data is often difficult due to class imbalance. Synthetic data generation, i.e., oversampling, is a common remedy used to improve classifier performance. State-of-the-art linear interpolation approaches, such as LoRAS and ProWRAS can be used to generate synthetic samples from the convex space of the minority class to improve classifier performance in such cases. Deep generative networks are common deep learning approaches for synthetic sample generation, widely used for synthetic image generation. However, their scope on synthetic tabular data generation in the context of imbalanced classification is not adequately explored. In this article, we show that existing deep generative models perform poorly compared to linear interpolation based approaches for imbalanced classification problems on smaller tabular datasets. To overcome this, we propose a deep generative model, ConvGeN that combines the idea of convex space learning with deep generative models. ConvGeN learns the coefficients for the convex combinations of the minority class samples, such that the synthetic data is distinct enough from the majority class. Our benchmarking experiments demonstrate that our proposed model ConvGeN improves imbalanced classification on such small datasets, as compared to existing deep generative models, while being at-par with the existing linear interpolation approaches. Moreover, we discuss how our model can be used for synthetic tabular data generation in general, even outside the scope of data imbalance and thus, improves the overall applicability of convex space learning.
Inside a radical new project to democratize AI
Unlike other, more famous large language models such as OpenAI's GPT-3 and Google's LaMDA, BLOOM (which stands for BigScience Large Open-science Open-access Multilingual Language Model) is designed to be as transparent as possible, with researchers sharing details about the data it was trained on, the challenges in its development, and the way they evaluated its performance. OpenAI and Google have not shared their code or made their models available to the public, and external researchers have very little understanding of how these models are trained. BLOOM was created over the last year by over 1,000 volunteer researchers in a project called BigScience, which was coordinated by AI startup Hugging Face using funding from the French government. It officially launched on July 12. The researchers hope developing an open-access LLM that performs as well as other leading models will lead to long-lasting changes in the culture of AI development and help democratize access to cutting-edge AI technology for researchers around the world.
Where is 'I' in 'AI' anymore?
Last month, a group of Cosmopolitan editors, alongside digital artist Karen X. Cheng and members of artificial intelligence research lab OpenAI, created the first-ever magazine cover designed by artificial intelligence. This is the first-ever magazine cover generated using DALLE-2. Words I never thought I'd be saying? An image I generated is the cover of @cosmopolitan for their first ever AI-generated magazine cover #dalle #dalle2 pic.twitter.com/x2oqiNMRVx Recently, OpenAI's GPT-3 also published a research thesis on itself.
An AI was told to design the Apple Car. This is what it made… - Yanko Design
The results may look fascinating, but what's cooler is that this comes from OpenAI's DALL-E 2, founded by Elon Musk. So in a way, credit for this Apple Car goes to Tesla's Elon Musk?! Mmm?? Designed by Dall-E 2 based on a text prompt from designer, educator, and YouTuber John Mauriello, this Apple Car is fascinating for two prime reasons – the car's design itself, but more importantly, the underlying AI technology that ended up creating the car. The genesis for this idea came from Marques Brownlee's own efforts with DALL-E 2. In a YouTube video, Brownlee demonstrated how simply typing the words "Apple Car" resulted in a car that looked like the apple fruit. This became a starting point for Mauriello, who instead, decided to tweak the prompt a little to get more specialized results. Mauriello told the AI to design a "Minimalist Sportscar inspired by a MacBook and a Magic Mouse, built out of aluminum and glass", while also specifying it to design something in the style of Apple's former design head, Jony Ive.
From 'Barbies scissoring' to 'contorted emotion': the artists using AI
You type in words – however nonsensical or disjointed – and the algorithm creates a unique image based on your search. This is Dall-E 2, a startlingly advanced, image-generating AI trained on 250 million images, named after the surrealist artist Salvador Dalí and Pixar's Wall-E. While use of Dall-E 2 is currently limited to a narrow pool of people, Dall-E mini (or Craiyon) is a free, unrelated version that is open to the public. Drawing on 15m images, Dall-E mini's algorithm offers a smorgasbord of surreal images, complete with absurd compositions and blurred human forms. Already, trends have emerged: nuclear explosions, dumpster fires, toilets and giant eyeballs abound. On a dedicated Reddit thread, people delight in the images generated by the free, low-resolution version, which range from amusing (Kim Jong-un lego) to dark (The Last Supper by Salvador Dali), hellish (synchronized swimming in lava) and deeply disturbing (Steve Jobs introducing a guillotine). Like other machine-learning networks, this AI model seems biased in its images of people – who appear, perhaps unsurprisingly, overwhelmingly white and mostly male.