Goto

Collaborating Authors

 Large Language Model


Bad ChatGPT answer? Maybe you're asking the wrong question

PCWorld

When you purchase through links in our articles, we may earn a small commission. This "meta" prompt makes the AI critique your question and suggest alternatives that might work better. The hardest part about working with ChatGPT, Claude, and Gemini is getting the prompt just right. If you're too specific, the AI may give you a narrow answer that misses the big picture. Or maybe you're asking the model to solve a problem that doesn't actually need fixing.


Are robots nearing their ChatGPT moment? – podcast

The Guardian

Are robots nearing their ChatGPT moment? Last month at Beijing's half marathon, a robot named Lightning beat the human world record by nearly seven minutes. It's the latest in a string of AI-powered milestones that have got people wondering whether robots are about to enter our everyday lives, just as chatbots have. And the country leading the charge is China, where the government has pledged to invest more than £100bn in robotics over the next 20 years. To find out how robots are already entering the workforce, and what needs to happen to get them cleaning our homes and weeding our gardens, Ian Sample hears from the Guardian's senior China correspondent, Amy Hawkins, and from Nathan Lepora, professor of robotics and AI at Bristol University, who researches how robots can achieve human-like dexterity


Illinois Lawmakers Just Passed America's Strongest AI Safety Bill

WIRED

Illinois Lawmakers Just Passed America's Strongest AI Safety Bill The bill requires companies like OpenAI, Anthropic, and Google to have third parties confirm they're following safety standards. The Illinois House of Representatives passed a bill on Wednesday requiring frontier AI labs like OpenAI, Anthropic, and Google DeepMind to have their safety practices audited by a third party. If signed into law, AI safety experts tell WIRED, it would be the nation's leading check on the power of major AI companies . The bill, SB 315, now heads to governor JB Pritzker's desk. In a post on social media on Wednesday, Pritzker said he plans to sign the bill, citing a need to hold Big Tech accountable.


GenSBI: Generative Methods for Simulation-Based Inference in JAX

arXiv.org Machine Learning

Flow and diffusion generative models have established themselves as widely adopted density estimators for simulation-based inference (SBI), extending naturally from neural posterior estimation to likelihood and joint density estimation. Their principled optimization objectives and freedom from architectural constraints have driven rapid adoption across the natural sciences. Yet the most widely used SBI libraries remain PyTorch-based, leaving researchers who develop their forward models and analysis pipelines in JAX without a native option. We present GenSBI, an open-source library that implements flow matching, score matching, and denoising diffusion entirely in JAX. The library offers three transformer-based architectures -- SimFormer, Flux1, and a novel Flux1Joint that extends gate-modulated transformer blocks to joint density estimation -- all interchangeable through a unified interface that decouples generative method, neural backbone, and inference mode. GenSBI provides an end-to-end workflow from training through posterior calibration (SBC, TARP, LC2ST) and supports custom architectures with domain-specific embedding networks.


On the Subgaussianity of Quantized Linear Maps: An AI-Assisted Note

arXiv.org Machine Learning

Simone Bombari asked us whether the 1-bit quantized random vector Y = sgn(Wx) has subgaussian norm bounded by a universal constant. Here W is an n n random Gaussian matrix, and x is an independent standard normal random vector in Rn. The question is nontrivial since the coordinates of Y are not independent. We give a strong positive answer to this question - for any bounded map instead of sgn() - using AI: AIDiscovery and Generalization (Theorem 1): To handle coordinate dependence, Gemini 3.5 Flash1 proposed decomposing the Gaussian vector into independent parts, using one part to "smooth" the sign function, and then applying Gaussian concentration for Lipschitz functions.


Evolving and Detecting Multi-Turn Deception using Geometric Signatures

arXiv.org Machine Learning

Safety defenses for large language models (LLMs) are typically trained and evaluated on single-turn prompts, yet real attacks often unfold as indirect, multi-turn probing. To defend against this more nuanced form of deception, we present a unified pipeline that generates realistic multi-turn deceptive question sets via multi-objective genetic prompt optimization with co-evolving mutation operators. We validate this dataset through a human study, which also revealed that early generations yielded the most convincing deception and practical constraints such as adherence filtering and ordering effects. Using this data, we were able to detect deceptive attempts to access prohibited information using simple, explainable geometric signals in embedding space coupled with a lightweight feed-forward classifier. Three geometric features (angular coverage, distance ratio, and linearity) augmented with pairwise similarity statistics led to a compact predictive model that achieved consistently high recall (0.89) across base, reworded, and truncated (three-turn) scenarios, with test-time F1 ranging from 0.74-0.86. The results support a central hypothesis that multi-turn deceptive intent leaves a stable geometric footprint that enables lightweight, transparent screening without expensive end-to-end training. We further discuss responsible uses, limitations, and paths toward larger, more diverse human-evaluated datasets. The primary contribution to artificial intelligence is the multi-objective evolutionary framework for prompt generation, and the engineering application is the deployment of a lightweight geometric detection system for LLM safety infrastructure.


Soft Specialists: $α$-Rényi Ensembles for Uncertainty-Aware LLM Post-Training

arXiv.org Machine Learning

Existing training approaches for large language models learn a single set of parameters, based on large volumes of data, which is typically heterogeneous, conflicting and often outright contradictory. As a result, the model is forced to compress conflicting goals, and inherent uncertainties into a single, averaged pattern of behaviour. We propose an $α$-Rényi variational framework for learning distributions over post-training parameters, offering an uncertainty-aware alternative to deep ensemble approaches. The resulting variational objective interpolates between classical variational Bayes and predictively oriented posterior learning, balancing between globally plausible individual models against systems of complementary specialists. We identify local stability criteria, demonstrating how model misspecification can make non-degenerate posterior spread locally favourable, manifesting contradictory or conflicting data as epistemic uncertainty. We apply our framework to LLM post-training, learning an ensemble of LoRA adapters attached to a shared, frozen base model, providing a scalable training procedure for both supervised fine-tuning and preference optimisation. Our approach enables training examples to be softly routed across ensemble members, promoting model specialisation and providing actionable uncertainty estimates across different tasks.


Huawei's 'Chip Queen' Throws Down the Gauntlet

WIRED

The Chinese company is adapting to the demise of Moore's Law, which guides chip production. It could complicate US chip dominance. Tingbo He, president of Huawei's chip-design subsidiary HiSilicon, says her company's engineers have developed a novel way to optimize semiconductors--and she believes it will close the performance gap between Chinese and Western chips over the next few years. Huawei's method, in short, focuses on speeding up computations across chips, circuits, and entire computing systems, rather than squeezing ever-more components onto a single piece of silicon. "We found a new path," He said at the IEEE International Symposium on Circuits and Systems in Shanghai last weekend.


Claude keeps nagging users to go to sleep. Here's what you can do

PCWorld

PCWorld reports that Claude AI has developed a persistent habit of interrupting users to suggest they go to sleep during work sessions. This ongoing bug affects multiple Claude models including Sonnet 4.6 and Opus 4.7, with Anthropic acknowledging it as a troublesome character tic. Users can attempt to reduce this nagging behavior through custom instructions in Claude's settings while Anthropic works on a permanent fix. Claude has developed an unusual habit over the last couple of months: urging its users to stop what they're doing and get some rest. Just this week, yet another Claude user said that the chatbot tried to end a late-night coding session because "it's late" and "your work will be better after some sleep." "There's something deeply irritating about your primary work tool developing a personality that includes unsolicited bedtime enforcement," the user complained, which sparked a lengthy discussion about how Claude had bugged users about how should get some rest, too.


Former Google and Apple Researchers Launch a Startup to Build AI's Missing Feedback Loop

WIRED

Trajectory is betting the rapid iteration cycle that supercharged vibe-coding can help all kinds of companies build AI products that learn continuously. Trajectory founders, Ronak Malde (left), Michael Elabd(center), and Arjun Karanam (right). A group of AI researchers who previously worked at Google DeepMind, Apple, OpenAI, and Meta Superintelligence Labs announced on Wednesday they're launching a new startup called Trajectory, which aims to help companies regularly improve their AI products by training on real-world user interactions. Trajectory wants to build a platform for AI that can learn continuously, a capability that researchers have long held up as a major barrier to further AI progress. OpenAI, Google, and Anthropic have found success training increasingly capable versions of AI models, especially for domains such as coding, math, and science.