Goto

Collaborating Authors

 Generative AI


openai/CLIP

#artificialintelligence

CLIP (Contrastive Language-Image Pre-Training) is a neural network trained on a variety of (image, text) pairs. It can be instructed in natural language to predict the most relevant text snippet, given an image, without directly optimizing for the task, similarly to the zero-shot capabilities of GPT-2 and 3. We found CLIP matches the performance of the original ResNet50 on ImageNet "zero-shot" without using any of the original 1.28M labeled examples, overcoming several major challenges in computer vision. First, install PyTorch 1.7.1 and torchvision, as well as small additional dependencies, and then install this repo as a Python package. Returns the model and the TorchVision transform needed by the model, specified by the model name returned by clip.available_models(). The name argument can also be a path to a local checkpoint.


OpenAI's text-generating system GPT-3 is now spewing out 4.5 billion words a day

#artificialintelligence

One of the biggest trends in machine learning right now is text generation. AI systems learn by absorbing billions of words scraped from the internet and generate text in response to a variety of prompts. It sounds simple, but these machines can be put to a wide array of tasks -- from creating fiction, to writing bad code, to letting you chat with historical figures. The best-known AI text-generator is OpenAI's GPT-3, which the company recently announced is now being used in more than 300 different apps, by "tens of thousands" of developers, and producing 4.5 billion words per day. This may be an arbitrary milestone for OpenAI to celebrate, but it's also a useful indicator of the growing scale, impact, and commercial potential of AI text generation.


OpenAI's Sam Altman: Artificial Intelligence will generate enough wealth to pay each adult $13,500 a year

#artificialintelligence

Artificial intelligence will create so much wealth that every adult in the United States could be paid $13,500 per year from its windfall as soon as 10 years from now. So says Sam Altman, co-founder and president of San Francisco-headquartered, artificial intelligence-focused nonprofit OpenAI. "My work at OpenAI reminds me every day about the magnitude of the socioeconomic change that is coming sooner than most people believe," Altman, who posted Tuesday. "Software that can think and learn will do more and more of the work that people now do." Altman calls it an "AI revolution," and compares it in magnitude to the agricultural, industrial and computational technological revolutions.


Artificial intelligence researchers rank the top A.I. labs worldwide

#artificialintelligence

Artificial intelligence researchers don't like it when you ask them to name the top AI labs in the world, possibly because it's so hard to answer. There are some obvious contenders when it comes to commercial AI labs. U.S. Big Tech -- Google, Facebook, Amazon, Apple and Microsoft -- have all set up dedicated AI labs over the last decade. There's also DeepMind, which is owned by Google parent company Alphabet, and OpenAI, which counts Elon Musk as a founding investor. "Wow, I hate this question," Mark Riedl, associate professor at the Georgia Tech School of Interactive Computing, told CNBC when asked to pick his standouts.


Python Code Assistant Powered by GPT-3

#artificialintelligence

GPT-3 from OpenAI has captured public attention unlike any other AI model in the 21st century. The sheer flexibility of the model in performing a series of generalized tasks with near-human efficiency and accuracy is what makes it so exciting. It has created a paradigm shift in the world of Natural Language Processing(NLP), where till now the models were trained based on the ungenralized approach to excel at one or two tasks. GPT-3 is trained by OpenAI with a generalized approach on a massive scale involving 175 billion parameters which allows it to mimic functionalities of the human brain (like GPT-3 is capable of generating text that is surprisingly human-like after only being fed a few examples of the task you want it to do). Like a human brain GPT-3 is able to learn and do things with few shots of training unlike the conventional way of training an NLP model over a large corpus, which is both difficult and time-consuming.


The Power of Scale

#artificialintelligence

On May 2020, OpenAI introduced GPT-3 which is the third iteration of GPT language generation model series. The model boasted of a capacity of 175 billion parameters, more than 10 times than any other language model created before it. And to say that the model was a significant improvement would be an understatement. It could write essays on any topic without any inconsistencies in the output text. When trained on code samples, it could generate small code snippets (which were actually functional!) by getting a description of the task in English by the user. A group of developers trained it to update financial statements on Microsoft Excel based on casual description of transactions.



Multimodal Neurons in Artificial Neural Networks

#artificialintelligence

We've discovered neurons in CLIP that respond to the same concept whether presented literally, symbolically, or conceptually. This may explain CLIP's accuracy in classifying surprising visual renditions of concepts, and is also an important step toward understanding the associations and biases that CLIP and similar models learn. Fifteen years ago, Quiroga et al. discovered that the human brain possesses multimodal neurons. These neurons respond to clusters of abstract concepts centered around a common high-level theme, rather than any specific visual feature. The most famous of these was the "Halle Berry" neuron, a neuron featured in both Scientific American and The New York Times, that responds to photographs, sketches, and the text "Halle Berry" (but not other names).


OpenAI's state-of-the-art machine vision AI is fooled by handwritten notes

#artificialintelligence

Researchers from machine learning lab OpenAI have discovered that their state-of-the-art computer vision system can be deceived by tools no more sophisticated than a pen and a pad. As illustrated in the image above, simply writing down the name of an object and sticking it on another can be enough to trick the software into misidentifying what it sees. "We refer to these attacks as typographic attacks," write OpenAI's researchers in a blog post. "By exploiting the model's ability to read text robustly, we find that even photographs of hand-written text can often fool the model." They note that such attacks are similar to "adversarial images" that can fool commercial machine vision systems, but far simpler to produce.


DALL-E Makes Creative Images From Text: Science Fiction in the News

#artificialintelligence

Dall-E is a model based on GPT-3 that generates images using a short text caption. The name "DALL-E" is a portmanteau of the artist Salvador Dal-- and Pixar--s WALL--E . For example, if you asked for an armchair like an avocado, what might it look like?