Generative AI
The Oligarchy Is Afraid of Itself Too
Musk v. Altman is a fight over how much power is too much in Silicon Valley. Get your news from a source that's not owned and controlled by oligarchs. In May 2016, Elon Musk did something out of character that he has now spent years of his life trying to undo: He made what he believed to be a charitable donation. The world's richest man is also among its stingiest. Musk's private foundation often doles out less than the minimum percentage required by law.
OpenAI Really Wants Codex to Shut Up About Goblins
"Never talk about goblins, gremlins, raccoons, trolls, ogres, pigeons, or other animals or creatures unless it is absolutely and unambiguously relevant," reads OpenAI's coding agent instructions. OpenAI has a goblin problem. Instructions designed to guide the behavior of the company's latest model as it writes code have been revealed to include a line, repeated several times, that specifically forbids it from randomly mentioning an assortment of mythical and real creatures. "Never talk about goblins, gremlins, raccoons, trolls, ogres, pigeons, or other animals or creatures unless it is absolutely and unambiguously relevant to the user's query," read instructions in Codex CLI, a command-line tool for using AI to generate code. It is unclear why OpenAI felt compelled to spell this out for Codex --or indeed why its models might want to discuss goblins or pigeons in the first place.
Sam Altman and Elon Musk Sure Dislike Each Other
The trial between the CEOs makes the AI boom seem sordid and small. Elon Musk and Sam Altman are two of the most influential people in Silicon Valley, if not the world. Between the two of them, Musk and Altman run technology companies worth many trillions of dollars that promise to reshape civilization. But this morning, both sat under fluorescent lights in a courthouse in downtown Oakland, suffering through all manner of technical glitches as their respective attorneys kicked off the long-awaited trial in . As Steven Molo, a lawyer for Musk, began his opening argument, confused looks swept the courtroom.
Musk testifies at OpenAI trial it's not OK to 'loot a charity'
Musk testifies at OpenAI trial it's not OK to'loot a charity' Elon Musk has taken the stand at a high-stakes trial over the future of OpenAI, casting his lawsuit against the ChatGPT maker as a defence of charitable giving. The world's richest person is suing OpenAI, its cofounder and chief executive officer, Sam Altman, and its president, Greg Brockman, and said on the stand on Tuesday that they betrayed him and the public by abandoning OpenAI's mission to be a benevolent steward of AI for humanity and transforming the nonprofit into a profit-seeking juggernaut. Musk, who founded carmaker Tesla and rocket company SpaceX, also said he is committed to serving the public by working 80-to 100-hour weeks and generally not taking vacations. "I like working and solving problems that make people's lives better," he said. Before Musk began testifying, Bill Savitt, a lawyer for OpenAI and Altman, told jurors during his opening statement it was Musk who saw dollar signs as he helped finance OpenAI's early growth and pushed it to become a for-profit business, one he might eventually lead as CEO.
Elon Musk Testifies That He Started OpenAI to Prevent a 'Terminator Outcome'
Elon Musk Testifies That He Started OpenAI to Prevent a'Terminator Outcome' The judge also warned Musk and Sam Altman to curb their "propensity to use social media to make things worse outside the courtroom" after both sides traded attacks online. Elon Musk and Sam Altman appeared in a federal courtroom together for the first time on Tuesday as they fight over OpenAI's decade-long evolution and what it means for the company's future. The trial in Musk's lawsuit against Altman could result in financial damages and, more significantly, governance changes at OpenAI that may complicate its plans for an initial public offering as soon as this year. As the first witness on the stand, Musk immediately sought to frame his case as more than just about OpenAI. Siding with Altman "will give license to looting every charity in America" and shake the "entire foundation of charitable giving," Musk told a panel of nine jurors advising US District Judge Yvonne Gonzalez Rogers on how to rule.
Musk says basis of charitable giving at stake in OpenAI lawsuit
A trial pitting two founders of OpenAI - Sam Altman and Elon Musk - against each other has opened in California, with the sides presenting duelling narratives about the company's history and obligations to consumers. Musk, wearing a dark suit and tie, was asked by one of his lawyers what the lawsuit was about when he took the stand. It's actually very simple, he said. It's not okay to steal a charity... If it's okay to loot a charity, the entire foundation of charitable giving will be destroyed.
Anthropic's Little Brother
OpenAI is racing to catch up to its greatest rival. OpenAI does not like to be left out. The week after Anthropic announced Claude Mythos Preview --an AI model that has put governments around the world on edge because of its potential ability to hack into banks, energy grids, and military systems--OpenAI shared a program that is uncannily similar. And just like Anthropic did with its model, OpenAI has, for cybersecurity purposes, restricted access to this new bot, called GPT-5.4-Cyber, to a small group of trusted users. This sequence has become something of a pattern: First Anthropic will make an announcement, and then OpenAI will follow suit.
The Download: Musk and Altman's legal showdown, and AI's profit problem
Plus: OpenAI has ended its exclusive partnership with Microsoft. Elon Musk and Sam Altman are going to court over OpenAI's future Ahead of OpenAI's IPO, the court could rule on whether the company can exist as a for-profit enterprise. It could even oust its leadership. Musk, an OpenAI co-founder, claims he was deceived into bankrolling the firm under false pretenses. Find out how the trial could upend the global AI race . In a celebrated episode, a community of gnomes sneak out at night to steal underpants.
Musk v Altman: The most toxic row in tech goes on trial
The bitter feud between Elon Musk and OpenAI boss Sam Altman has raged for years, but has mostly played out online in the form of accusations, counter-accusations and jibes. But starting on Tuesday, the beef between the two tech billionaires will shift to a much higher-profile forum: a federal courtroom in California, where their row will be the focus of a month-long trial. Being considered is Musk's claim that Altman - with whom he founded OpenAI - has swindled him out of millions of dollars and reneged on the ChatGPT-maker's original non-profit mission. Musk and Altman themselves will be among those to testify in a case in which the future of AI could be at stake. And while one will presumably emerge the winner, it's plausible that neither will emerge from the saga unscathed.
ProEval: Proactive Failure Discovery and Efficient Performance Estimation for Generative AI Evaluation
Huang, Yizheng, Zeng, Wenjun, Kumaresan, Aditi, Wang, Zi
Evaluating generative AI models is increasingly resource-intensive due to slow inference, expensive raters, and a rapidly growing landscape of models and benchmarks. We propose ProEval, a proactive evaluation framework that leverages transfer learning to efficiently estimate performance and identify failure cases. ProEval employs pre-trained Gaussian Processes (GPs) as surrogates for the performance score function, mapping model inputs to metrics such as the severity of errors or safety violations. By framing performance estimation as Bayesian quadrature (BQ) and failure discovery as superlevel set sampling, we develop uncertainty-aware decision strategies that actively select or synthesize highly informative inputs for testing. Theoretically, we prove that our pre-trained GP-based BQ estimator is unbiased and bounded. Empirically, extensive experiments on reasoning, safety alignment, and classification benchmarks demonstrate that ProEval is significantly more efficient than competitive baselines. It requires 8-65x fewer samples to achieve estimates within 1% of the ground truth, while simultaneously revealing more diverse failure cases under a stricter evaluation budget.