Goto

Collaborating Authors

 Generative AI


OpenAI will reportedly pay 250 million to put News Corp's journalism in ChatGPT

Engadget

OpenAI and News Corp, the owner of The Wall Street Journal, MarketWatch, The Sun, and more than a dozen other publishing brands, have struck a multi-year deal to display news from these publications in ChatGPT, News Corp announced on Wednesday. OpenAI will be able to access both current and well as archived content from News Corp's publications and use the data to further train its AI models. Neither company disclosed the terms of the deal, but a report in The Wall Street Journal estimated that News Corp would get 250 million over five years in cash and credits. "The pact acknowledges that there is a premium for premium journalism," News Corp Chief Executive Robert Thomson reportedly said in a memo to employees on Wednesday. "The digital age has been characterized by the dominance of distributors, often at the expense of creators, and many media companies have been swept away by a remorseless technological tide. The onus is now on us to make the most of this providential opportunity."


OpenAI and Wall Street Journal owner News Corp sign content deal

The Guardian

ChatGPT developer OpenAI has signed a deal to bring news content from the Wall Street Journal, New York Post, the Times and the Sunday Times to the artificial intelligence platform, the companies said on Wednesday. Neither party disclosed a dollar figure for the deal. The deal will give OpenAI access to current and archived content from all of News Corp's publications. The deal comes weeks after the AI heavyweight signed a deal with the Financial Times to license its content for the development of AI models. Other publications, including the New York Times, have taken a different tack: suing OpenAI and Microsoft, the startup's key backer, over the use of its content to train generative AI and large-language model systems.


Faux ScarJo and the Descent of the A.I. Vultures

The New Yorker

On May 13th, during a live event, the artificial-intelligence company OpenAI unveiled the next generation of its technology, GPT-4o, the successor to GPT-3. When OpenAI first released its product to the public in late 2022, as the text-based tool ChatGPT, it nearly single-handedly ushered in the A.I. era. The latest version is far more powerful still. The "o" in the name stands for "omni"; the model can communicate seamlessly across various forms of media at once, including text, audio, and video, receiving prompts in one medium and responding in another. It can maintain a memory of everything you tell it.


Fox News AI Newsletter: Scarlett Johansson's AI accusation

FOX News

'IN DISBELIEF': Scarlett Johansson is "angered and in disbelief" by tech company OpenAI over its ChatGPT app's voice, Sky, noting it uses a voice very similar to hers. NVIDIA'S RISE: Wall Street is eagerly awaiting the latest earnings report Wednesday from Nvidia, which has experienced rapid growth amid the boom in artificial intelligence technology. ELECTRIC RUNNING ROBOT: Standing as tall as an average human and powered by a symphony of sensors and processors, Tiangong has the ability to jog at a steady pace, navigate complex terrain and perform tasks with precision. Tiangong represents a future where robots could possibly become our companions, helpers and perhaps even our friends. GOOGLE'S BIG REVEALS: Google's flagship developer conference called I/O just wrapped up with interesting leaps in how the Big Tech giant is planning to change the world.


The Low-Paid Humans Behind AI's Smarts Ask Biden to Free Them From 'Modern Day Slavery'

WIRED

AI projects like OpenAI's ChatGPT get part of their savvy from some of the lowest-paid workers in the tech industry--contractors often in poor countries paid small sums to correct chatbots and label images. On Wednesday, 97 African workers who do AI training work or online content moderation for companies like Meta and OpenAI published an open letter to President Biden, demanding that US tech companies stop "systemically abusing and exploiting African workers." Most of the letter's signatories are from Kenya, a hub for tech outsourcing, whose president, William Ruto, is visiting the US this week. The workers allege that the practices of companies like Meta, OpenAI, and data provider Scale AI "amount to modern day slavery." The companies did not immediately respond to a request for comment.


The Download: how criminals use AI, and OpenAI's Chinese data blunder

MIT Technology Review

Artificial intelligence has brought a big boost in productivity--to the criminal underworld. Generative AI provides a new, powerful tool kit that allows malicious actors to work far more efficiently and internationally than ever before. Over the past year, cybercriminals have mostly stopped developing their own AI models. Instead, they are opting for tricks with existing tools that work reliably. That's because criminals want an easy life and quick gains.


What Scarlett Johansson v. OpenAI Could Look Like in Court

WIRED

In a product demo last week, OpenAI showcased a synthetic but expressive voice for ChatGPT called "Sky" that reminded many viewers of the flirty AI girlfriend Samantha played by Scarlett Johansson in the 2013 film Her. One of those viewers was Johansson herself, who promptly hired legal counsel and sent letters to OpenAI demanding an explanation, according to a statement released later. In response, the company on Sunday halted use of Sky and published a blog post insisting that it "is not an imitation of Scarlett Johansson but belongs to a different professional actress using her own natural speaking voice." Johansson's statement, released Monday, said she was "shocked, angered, and in disbelief" by OpenAI's demo using a voice she called "so eerily similar to mine that my closest friends and news outlets could not tell the difference." Johansson revealed that she had turned down a request last year from the company's CEO, Sam Altman, to voice ChatGPT and that he had reached out again two days before last week's demo in an attempt to change her mind.


OpenAI's latest blunder shows the challenges facing Chinese AI models

MIT Technology Review

Add to that another thing OpenAI fumbled with GPT-4o: the data it used to train its tokenizer--a tool that helps the model parse and process text more efficiently--is polluted by Chinese spam websites. As a result, the model's Chinese token library is full of phrases related to pornography and gambling. This could worsen some problems that are common with AI models: hallucinations, poor performance, and misuse. I wrote about it on Friday after several researchers and AI industry insiders flagged the problem. They took a look at GPT-4o's public token library, which has been significantly updated with the new model to improve support of non-English languages, and saw that more than 90 of the 100 longest Chinese tokens in the model are from spam websites.


Second global AI summit secures safety commitments from companies

The Japan Times

Sixteen companies at the forefront of developing artificial intelligence (AI) pledged on Tuesday at a global meeting to develop the technology safely at a time when regulators are scrambling to keep up with rapid innovation and emerging risks. The companies included U.S. leaders Google, Meta, Microsoft and OpenAI, as well as firms from China, South Korea and the United Arab Emirates. They were backed by a broader declaration from the Group of Seven (G7) major economies, the EU, Singapore, Australia and South Korea at a virtual meeting hosted by British Prime Minister Rishi Sunak and South Korean President Yoon Suk-yeol.


From the evolution of public data ecosystems to the evolving horizons of the forward-looking intelligent public data ecosystem empowered by emerging technologies

arXiv.org Artificial Intelligence

Public data ecosystems (PDEs) represent complex socio-technical systems crucial for optimizing data use in the public sector and outside it. Recognizing their multifaceted nature, previous research pro-posed a six-generation Evolutionary Model of Public Data Ecosystems (EMPDE). Designed as a result of a systematic literature review on the topic spanning three decade, this model, while theoretically robust, necessitates empirical validation to enhance its practical applicability. This study addresses this gap by validating the theoretical model through a real-life examination in five European countries - Latvia, Serbia, Czech Republic, Spain, and Poland. This empirical validation provides insights into PDEs dynamics and variations of implementations across contexts, particularly focusing on the 6th generation of forward-looking PDE generation named "Intelligent Public Data Generation" that represents a paradigm shift driven by emerging technologies such as cloud computing, Artificial Intelligence, Natural Language Processing tools, Generative AI, and Large Language Models (LLM) with potential to contribute to both automation and augmentation of business processes within these ecosystems. By transcending their traditional status as a mere component, evolving into both an actor and a stakeholder simultaneously, these technologies catalyze innovation and progress, enhancing PDE management strategies to align with societal, regulatory, and technical imperatives in the digital era.