disinformation researcher raise alarm
Disinformation Researchers Raise Alarms About A.I. Chatbots
Personalized, real-time chatbots could share conspiracy theories in increasingly credible and persuasive ways, researchers say, smoothing out human errors like poor syntax and mistranslations and advancing beyond easily discoverable copy-paste jobs. Soon after ChatGPT debuted last year, researchers tested what the artificial intelligence chatbot would write after it was asked questions peppered with conspiracy theories and false narratives. The results -- in writings formatted as news articles, essays and television scripts -- were so troubling that the researchers minced no words. "This tool is going to be the most powerful tool for spreading misinformation that has ever been on the internet," said Gordon Crovitz, a co-chief executive of NewsGuard, a company that tracks online misinformation and conducted the experiment last month. "Crafting a new false narrative can now be done at dramatic scale, and much more frequently -- it's like having A.I. agents contributing to disinformation."
Disinformation Researchers Raise Alarms About A.I. Chatbots - The New York Times
In 2020, researchers at the Center on Terrorism, Extremism and Counterterrorism at the Middlebury Institute of International Studies found that GPT-3, the underlying technology for ChatGPT, had "impressively deep knowledge of extremist communities" and could be prompted to produce polemics in the style of mass shooters, fake forum threads discussing Nazism, a defense of QAnon and even multilingual extremist texts. OpenAI uses machines and humans to monitor content that is fed into and produced by ChatGPT, a spokesman said. The company relies on both its human A.I. trainers and feedback from users to identify and filter out toxic training data while teaching ChatGPT to produce better-informed responses. OpenAI's policies prohibit use of its technology to promote dishonesty, deceive or manipulate users or attempt to influence politics; the company offers a free moderation tool to handle content that promotes hate, self-harm, violence or sex. But at the moment, the tool offers limited support for languages other than English and does not identify political material, spam, deception or malware.