Large Language Model
Elon Musk Boosts New Yorker's Sam Altman Exposé on X as Trial Begins
Elon Musk Boosts New Yorker's Sam Altman Exposé on X as Trial Begins The move comes as the trial for Elon Musk's lawsuit against OpenAI kicks off in federal court in Oakland. Elon Musk is boosting a post on X promoting The New Yorker's extensive investigation into Sam Altman's allegedly deceptive behavior, WIRED has confirmed. The move comes just as Musk's lawsuit against OpenAI and Altman heads to a jury trial in a federal courtroom on Monday morning. People scrolling X on Monday reported seeing an April 6 post from Ronan Farrow, a coauthor on the New Yorker article, promoting the investigation. A pop-up on the post on X's mobile app says it was boosted by @elonmusk, who also owns the platform.
OpenAI's GPT-5.5 is faster, smarter, and a step toward its 'super app'
PCWorld reports that OpenAI has launched GPT-5.5, its most advanced AI model, exclusively for paying ChatGPT subscribers on Plus, Pro, Business, and Enterprise plans. The new model delivers faster, more efficient performance in coding, research, and math while outperforming competitors like Google's Gemini 3.1 Pro and Anthropic's Claude Opus 4.7. GPT-5.5 represents a significant step toward OpenAI's'super app' vision, integrating various AI services into one comprehensive platform. OpenAI recently launched GPT-5.5, which the company describes as its most advanced and intuitive AI model to date. The new model is said to be both faster and more efficient, with specific improvements in areas including coding, research, and math. At the same time, it's said to perform better compared to competing models like Google's Gemini 3.1 Pro and Anthropic's Claude Opus 4.7. According to OpenAI co-founder Greg Brockman, GPT-5.5 is also a step towards the company's vision of a future "super app," where services such as ChatGPT, Codex, and an AI-driven web browser are integrated into a single platform, reports TechCrunch . GPT-5.5 is currently rolling out to paying ChatGPT users, which includes those on Plus, Pro, Business, and Enterprise plans. This article originally appeared on our sister publication PC för Alla and was translated and localized from Swedish.
OpenAI breaks out of exclusivity agreements in its partnership with Microsoft
The two companies announced an amended partnership that lets OpenAI use other cloud platforms and offer its models to other companies. OpenAI is opening up its partnership with Microsoft in the latest amendment to the major multi-year collaboration between the tech giants. The latest changes allow OpenAI to offer its latest AI models to other companies and through other cloud providers, stripping Microsoft of its exclusivity rights. In a joint announcement posted on OpenAI and Microsoft's websites, Microsoft will still be OpenAI's primary cloud partner with the latest products shipping first on Azure, but OpenAI is now allowed to use any cloud provider. Sam Altman, OpenAI's CEO, posted on X that the company is now able to make our products and services available across all clouds.
Why Elon Musk and Sam Altman are fighting over OpenAI
Musk, who co-founded the company that created ChatGPT with Altman, wants more than $130 billion in damages in a lawsuit that could shakeup the artificial intelligence landscape. The BBC's Lily Jamali explains why the two tech giants are facing off in court. How much screen time is too much for under fives? Some major retailers and independent stores have introduced AI body scans, CCTV or facial recognition equipment to identify crimes like shoplifting. What does TikTok's deal mean for America's users?
The Download: DeepSeek's latest AI breakthrough, and the race to build world models
The Download: DeepSeek's latest AI breakthrough, and the race to build world models Plus: China has blocked Meta's $2 billion acquisition of AI startup Manus. On Friday, Chinese AI firm DeepSeek released a preview of V4, its long-awaited new flagship model. Notably, the model can process much longer prompts than its last generation, thanks to a new design that handles large amounts of text more efficiently. While the model remains open source, its performance matches leading closed-source rivals from Anthropic, OpenAI, and Google. Here are three ways V4 could shake up AI . AI systems have already gained impressive mastery over the digital world, but the physical world remains humanity's domain.
An eye for an ear: zero-shot audio description leveraging an image captioner with audio-visual token distribution matching
Multimodal large language models have fueled progress in image captioning. These models, fine-tuned on vast image datasets, exhibit a deep understanding of semantic concepts.In this work, we show that this ability can be re-purposed for audio captioning, where the joint image-language decoder can be leveraged to describe auditory content associated with image sequences within videos featuring audiovisual content. This can be achieved via multimodal alignment.Yet, this multimodal alignment task is non-trivial due to the inherent disparity between audible and visible elements in real-world videos. Moreover, multimodal representation learning often relies on contrastive learning, facing the challenge of the so-called modality gap which hinders smooth integration between modalities. In this work, we introduce a novel methodology for bridging the audiovisual modality gap by matching the distributions of tokens produced by an audio backbone and those of an image captioner. Our approach aligns the audio token distribution with that of the image tokens, enabling the model to perform zero-shot audio captioning in an unsupervised fashion. This alignment allows for the use of either audio or audiovisual input by combining or substituting the image encoder with the aligned audio encoder. Our method achieves significantly improved performances in zero-shot audio captioning, compared to existing approaches.
Elon Musk and Sam Altman face off in court over OpenAI's founding mission
The two Silicon Valley tycoons are headed to court. The two Silicon Valley tycoons are headed to court. Musk's lawsuit accuses Altman of fraud, while OpenAI says that Musk is'motivated by jealousy' A lawsuit between two of Silicon Valley's biggest tycoons goes to trial Monday in California, the culmination of a years-long bitter feud. Elon Musk has accused Sam Altman of betraying the founding agreement of the non-profit they started together, OpenAI, by changing it to a for-profit enterprise. Musk accuses Altman, OpenAI, its president Greg Brockman, and its major partner Microsoft of breach of contract and unjust enrichment in the lawsuit.
Large Language Models Are Bad Dice Players: LLMs Struggle to Generate Random Numbers from Statistical Distributions
Zhao, Minda, Du, Yilun, Wang, Mengyu
As large language models (LLMs) transition from chat interfaces to integral components of stochastic pipelines and systems approaching general intelligence, the ability to faithfully sample from specified probability distributions has become a functional requirement rather than a theoretical curiosity. We present the first large-scale, statistically powered audit of native probabilistic sampling in frontier LLMs, benchmarking 11 models across 15 distributions. To disentangle failure modes, we employ a dual-protocol design: Batch Generation, where a model produces $N{=}1000$ samples within one response, and Independent Requests, comprising $N{=}1000$ stateless calls. We observe a sharp protocol asymmetry: batch generation achieves only modest statistical validity, with a 7% median pass rate, while independent requests collapse almost entirely, with 10 of 11 models passing none of the distributions. Beyond this asymmetry, we reveal that sampling fidelity degrades monotonically with distributional complexity and aggravates as the sampling horizon $N$ increases. Finally, we demonstrate how the propagation of these failures into downstream real-world application tasks introduces systematic biases: models fail to enforce uniform answer-position constraints in Multiple Choice Question generation and systematically violate demographic targets in attribute-constrained text-to-image prompt synthesis. These findings indicate that current LLMs lack a functional internal sampler, necessitating external tools for applications requiring statistical guarantees.
Fine-grained Late-interaction Multi-modal Retrieval for Retrieval Augmented Visual Question Answering
Knowledge-based Visual Question Answering (KB-VQA) requires VQA systems to utilize knowledge from external knowledge bases to answer visually-grounded questions. Retrieval-Augmented Visual Question Answering (RA-VQA), a strong framework to tackle KB-VQA, first retrieves related documents with Dense Passage Retrieval (DPR) and then uses them to answer questions. This paper proposes Fine-grained Late-interaction Multi-modal Retrieval (FLMR) which significantly improves knowledge retrieval in RA-VQA. FLMR addresses two major limitations in RA-VQA's retriever: (1) the image representations obtained via image-to-text transforms can be incomplete and inaccurate and (2) relevance scores between queries and documents are computed with one-dimensional embeddings, which can be insensitive to finer-grained relevance. FLMR overcomes these limitations by obtaining image representations that complement those from the image-totext transforms using a vision model aligned with an existing text-based retriever through a simple alignment network. FLMR also encodes images and questions using multi-dimensional embeddings to capture finer-grained relevance between queries and documents. FLMR significantly improves the original RA-VQA retriever's PRRecall@5 by approximately 8%. Finally, we equipped RA-VQA with two state-of-the-art large multi-modal/language models to achieve 61% VQA score in the OK-VQA dataset.