gpt-4chan
Someone Trained an A.I. With 4chan. Yes, It Could Get Even Worse.
"How do you get a girlfriend?" This exchange would be pretty familiar in the more squalid corners of the internet, but it might surprise most readers to find out that the misogynistic response here was written by an A.I. Recently, a YouTuber in the A.I. community posted a video that explains how he trained an A.I. language model called "GPT-4chan" on the /pol/ board of 4chan, a forum filled with hate speech, racism, sexism, anti-Semitism, and any other offensive content one can imagine. The model was made by fine-tuning the open-source language model GPT-J (not to be confused with the more familiar GPT-3 from OpenAI). Having its language trained by the most vitriolic teacher possible, the designer then unleashed the A.I. on the forum, where it engaged with users and made over 30,000 posts (about 15,000 posted in a single day, which was 10 percent of all posts that day). "By taking away the rights of women" was just one example of GPT-4chan's responses to poster's questions.
- Asia > Russia (0.15)
- North America > United States > Arizona (0.05)
- Europe > Ukraine (0.05)
- Europe > Russia (0.05)
- Government (1.00)
- Information Technology > Security & Privacy (0.96)
- Law > Civil Rights & Constitutional Law (0.88)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.89)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.35)
Lessons from the GPT-4Chan Controversy
On June 3rd of 2022, YouTuber and AI researcher Yannic Kilcher released a video about how he developed an AI model named'GPT-4chan', and then deployed bots to pose as humans on the message board 4chan. GPT-4chan is a large language model, and so is essentially trained to'autocomplete' text -- given some text as input, it predicts what text is likely to follow -- by being optimized to mimic typical patterns of text in a bunch of files. In this case, the model was made by fine-tuning GPT-J with a previously published dataset to mimic the users of 4chan's /pol/ anonymous message board; many of these users frequently express racist, white supremacist, antisemitic, anti-Muslim, misogynist, and anti-LGBT views. The model thus learned to output all sorts of hate speech, leading Yannic to call it "The most horrible model on the internet" and to say this in his video: The video also contains the following: a brief set of disclaimers, some discussion of bots on the internet, a high level explanation of how the model was developed, some other thoughts on how good the model is, and a description of how a number of bots powered by the model were deployed to post on the /pol/ message board anonymously. The bots collectively wrote over 30,000 posts over the span of a few days, with 15,000 being posted over a span of 24 hours. Many users were at first confused, but the frequency of posting all over the message board soon led them to conclude this was a bot.
- Law > Civil Rights & Constitutional Law (0.54)
- Law Enforcement & Public Safety > Terrorism (0.54)
Nonsense Sentience, Condemning GPT-4chan, DeepFake Bans, CVPR Plagiarism
This week: LaMDA's Sentience is Nonsense, Condemning the deployment of GPT-4chan, Only 12% of companies are'AI Achievers', EU To Target Big Tech, Over Deepfakes, and more! If you are a fan, we'd appreciate your feedback! Feel free to let us know your thoughts via a review on Apple Podcast, email to contact@lastweekin.ai, or just DM us on Twitter!
Cerebras sets record for largest AI model on a single chip
In brief US hardware startup Cerebras claims to have trained the largest AI model on a single device powered by the world's largest Wafer Scale Engine 2 chip the size of a plate. "Using the Cerebras Software Platform (CSoft), our customers can easily train state-of-the-art GPT language models (such as GPT-3 and GPT-J) with up to 20 billion parameters on a single CS-2 system," the company claimed this week. "Running on a single CS-2, these models take minutes to set up and users can quickly move between models with just a few keystrokes." The CS-2 packs a whopping 850,000 cores, and has 40GB of on-chip memory capable of reaching 20 PB/sec memory bandwidth. The specs on other types of AI accelerators and GPUs pale in comparison, meaning machine learning engineers have to train huge AI models with billions of parameters across more servers.
Fun AI Apps Are Everywhere Right Now. But a Safety 'Reckoning' Is Coming
If you've spent any time on Twitter lately, you may have seen a viral black-and-white image depicting Jar Jar Binks at the Nuremberg Trials, or a courtroom sketch of Snoop Dogg being sued by Snoopy. These surreal creations are the products of Dall-E Mini, a popular web app that creates images on demand. Type in a prompt, and it will rapidly produce a handful of cartoon images depicting whatever you've asked for. More than 200,000 people are now using Dall-E Mini every day, its creator says--a number that is only growing. A Twitter account called "Weird Dall-E Generations," created in February, has more than 890,000 followers at the time of publication.
- Europe > Germany > Bavaria > Middle Franconia > Nuremberg (0.24)
- North America > United States > Texas > Harris County > Houston (0.14)
- Media (1.00)
- Information Technology > Services (0.67)
Is GPT-4chan the worst AI ever?
The bot was trained on three years' worth of posts from 4chan, the repulsive cousin of Reddit. Kilchner fed the bot threads from the Politically Incorrect /pol/ board, a 4chan message board notorious for racist, xenophobic, and hateful content. The bot sparked a heated debate on social media before it went offline. This is the worst AI ever! I trained a language model on 4chan's /pol/ board and the result is…. Watch here (warning: may be offensive):https://t.co/lihsaYAm7l pic.twitter.com/xs7rgtucQb
- North America > Canada > Quebec > Montreal (0.05)
- Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.05)
- Information Technology > Communications > Social Media (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.74)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.52)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.33)
Fun AI Apps Are Everywhere Right Now. But a Safety 'Reckoning' Is Coming
If you've spent any time on Twitter lately, you may have seen a viral black-and-white image depicting Jar Jar Binks at the Nuremberg Trials, or a courtroom sketch of Snoop Dogg being sued by Snoopy. These surreal creations are the products of Dall-E Mini, a popular web app that creates images on demand. Type in a prompt, and it will rapidly produce a handful of cartoon images depicting whatever you've asked for. More than 200,000 people are now using Dall-E Mini every day, its creator says--a number that is only growing. A Twitter account called "Weird Dall-E Generations," created in February, has more than 890,000 followers at the time of publication.
- Europe > Germany > Bavaria > Middle Franconia > Nuremberg (0.24)
- North America > United States > Texas > Harris County > Houston (0.14)
- Media (1.00)
- Information Technology > Services (0.67)
Oh no... Someone trained an AI on 4chan
If you're concerned about the biases and bigotry of AI models, you're gonna love the latest addition to the ranks: a text generator trained on 4chan's /pol/ board. Short for "Politically Incorrect," /pol/ is a bastion of hate speech, conspiracy theories, and far-right extremism. These attributes attracted Yannick Kilcher, an AI whizz and YouTuber, to use /pol/ as a testing ground for bots. Kilcher first fine-tuned the GPT-J language model on over 134.5 million posts made on /pol/ across three and a half years. He then incorporated the board's thread structure into the system.