Generative AI
A college kid used AI to create a fake blog. It reached #1 on Hacker News.
GPT-3 is OpenAI's latest and largest language AI model, which the San Francisco–based research lab began drip-feeding out in mid-July. In February of last year, OpenAI made headlines with GPT-2, an earlier version of the algorithm, which it announced it would withhold for fear it would be abused. The decision immediately sparked a backlash, as researchers accused the lab of pulling a stunt. By November, the lab had reversed position and released the model, saying it had detected "no strong evidence of misuse so far." The lab took a different approach with GPT-3; it neither withheld it nor granted public access.
GPT-3, explained: This new language AI is uncanny, funny -- and a big deal
Last month, OpenAI, the Elon Musk-founded artificial intelligence research lab, announced the arrival of the newest version of an AI system it had been working on that can mimic human language, a model called GPT-3. In the weeks that followed, people got the chance to play with the program. If you follow news about AI, you may have seen some headlines calling it a huge step forward, even a scary one. I've now spent the past few days looking at GPT-3 in greater depth and playing around with it. I'm here to tell you: The hype is real. It has its shortcomings, but make no mistake: GPT-3 represents a tremendous leap for AI. A year ago I sat down to play with GPT-3's precursor dubbed (you guessed it) GPT-2.
OpenAI GPT-3: How It Works and Why It Matters - DZone AI
You have probably heard about an innovative language model called GPT3. The hype is so overwhelming that we decided to research its core and the consequences for the tech players. Let's explore whether the language deserves this much attention and what makes it so exceptional. GPT-3 is a text generating neural network that was released in June 2020 and tested for $14 million. Its creator is the AI research agency OpenAI headed by Sam Altman, Marc Benioff, Elon Musk, and Reid Hoffman. The language is based on 175 million parameters and is by far more accurate than its predecessors.
Exploring GPT-3: A New Breakthrough in Language Generation - KDnuggets
It seems like only last year that we were arguing about whether the slow-release rollout of the 1.5 billion parameter Generative Pretrained Transformer-2 (GPT-2) was reasonable. If the debate seems recent, that's because it is (writing from 2020): The notorious GPT-2 model was announced by OpenAI in February 2019, but it wasn't fully released until nearly 9 months later (although it was replicated before that). The release schedule was admittedly somewhat experimental, meant more to foster discussion of responsible open publishing, rather than a last-ditch effort to avert an AI apocalypse. All that is a bit moot by now because not only has OpenAI trained a much larger language model in GPT-3, but you can sign up to access it through their new API. Comparing GPT-3 to GPT-2 is like comparing apples to, well, raisins, because the model is about that much larger.
The Guardian view on artificial intelligence's revolution: learning but not as we know it
Bosses don't often play down their products. Sam Altman, the CEO of artificial intelligence company OpenAI, did just that when people went gaga over his company's latest software: the Generative Pretrained Transformer 3 (GPT-3). For some, GPT-3 represented a moment in which one scientific era ends and another is born. Mr Altman rightly lowered expectations. "The GPT-3 hype is way too much," he tweeted last month.
OpenAI Microscope
We're introducing OpenAI Microscope, a collection of visualizations of every significant layer and neuron of eight vision "model organisms" which are often studied in interpretability. Microscope makes it easier to analyze the features that form inside these neural networks, and we hope it will help the research community as we move towards understanding these complicated systems. The abilities of modern neural networks are the result of the interactions of thousands of neurons (sometimes tens of thousands or more!). In order to understand their behavior, we'd like to be able to quickly and easily investigate these neurons interactions in detail, and share those observations. This is especially true in collaborative environments.
15 Interesting Ways OpenAI's GPT-3 Has Been Put To Use
First, you must know that the sun is actually a cat. Also, you must know that the sun is actually not a cat. Over the past couple of weeks, the ML community had their handsful discussing and displaying the wide range of utilities of GPT-3. Many developers, both professionals and amateurs, have expressed their surprise saying how most of the demos generated using GPT-3 in a few minutes would usually require significant engineering effort and machine learning expertise. In the next section, we list 15 exciting ways in which GPT-3 has been leveraged.
GPT-3 has its Breakthroughs as Well as Flaws
GPT-3 is a language model that is automated by a neural system, launched by OpenAI in July 2020. It's a text generator that can compose articles, poetry, sentiment essays, and working code--which is the reason it has the entire world humming, some with excitement, some with skepticism. The previous GPT model had 1.5 billion parameters and was the biggest model in those days, which was before long overshadowed by NVIDIA's Megatron, with 8 billion parameters followed by Microsoft's Turing NLG that had 17 billion parameters. Presently, OpenAI changes the situation by deploying a model that is 10 times bigger than Turing NLG. Current NLP frameworks still to a great extent struggle to learn from a couple of models.
Shrinking deep learning's carbon footprint – Tech Check News
In June, OpenAI unveiled the largest language model in the world, a text-generating tool called GPT-3 that can write creative fiction, translate legalese into plain English, and answer obscure trivia questions. It's the latest feat of intelligence achieved by deep learning, a machine learning method patterned after the way neurons in the brain process and store information. But it came at a hefty price: at least $4.6 million and 355 years in computing time, assuming the model was trained on a standard neural network chip, or […]