Cambridge
Replace or Reshape: How AI Could Change the Way We Work
Christopher Marquis is a professor at the University of Cambridge and the author of The Profiteers. In 1930, in the depths of the Great Depression, John Maynard Keynes wrote a short essay called . It is often remembered for one striking prediction: by 2030, people in wealthy countries might only need to work about 15 hours a week. What Keynes imagined was a society advanced enough to solve what he called the "economic problem" of basic material provision. If technology kept improving, and societies kept growing richer, then fewer hours of human labor would be needed to produce the necessities and comforts of life.
Image Empire – a new short film from Alan Warburton
The film forms part of a research project undertaken by Alan Warburton which also includes a research paper and a series of satellite events. The film is based on doctoral research undertaken at Birkbeck's Vasari Centre for Art & Technology. It was commissioned by the National Videogame Museum in collaboration with the Open Data Institute (ODI) and Cambridge University's Leverhulme Centre for the Future of Intelligence . The ODI hosted a webinar on 6 May to discuss the content of the film. The panellists explored what AI can and can't do, what effects a collapse of real and virtual could have on visual culture, and if we're living in a post-truth world.
Three Costs of Amortizing Gaussian Process Inference with Neural Processes
Neural processes amortize Gaussian process inference, replacing the exact $O(n^3)$ posterior with a learned $O(n)$ map from context sets to predictive distributions. For a class of latent neural processes, we bound the Kullback--Leibler (KL) divergence between the GP and LNP predictives, decomposing it into three interpretable sources, namely label contamination as the neural process uses label values to estimate a quantity that is label-independent in the exact GP, an information bottleneck because the finite-dimensional representation cannot resolve the full context geometry, and amortization error from a single encoder network shared across all contexts. The bottleneck truncation term decays in the representation dimension $d$ as $O(e^{-cd^{2/d_x}})$ for squared-exponential kernels on $\mathbb{R}^{d_x}$ where $c > 0$ is a kernel-dependent constant and as $O(d^{-2ν/d_x})$ for Matérn-$ν$ kernels, directly linking architecture sizing to kernel smoothness and input dimension. The label contamination term is $O(1)$ in general, with only the observation-noise component decaying as $O(1/n)$, identifying a persistent cost of routing uncertainty estimation through a label-dependent representation. These results characterize the costs of amortization within the analyzed class and yield architectural recommendations to predict variance from context locations alone in the GP-amortization regime, and replace mean aggregation with second-order pooling to close the dominant amortization gap.
Aerodynamic force reconstruction using physics-informed Gaussian processes
Tondo, Gledson Rodrigo, Kavrakov, Igor, Morgenthal, Guido
Accurate modeling of aerodynamic loads is essential for understanding and predicting the responses of complex structural systems. However, these models often rely on simplifications of the true physical forces, introducing assumptions that can limit their accuracy. Validating such models becomes particularly challenging in the presence of noisy or incomplete data. To address this, we introduce a probabilistic physics-informed machine learning approach designed to reconstruct the underlying aerodynamic loads from noisy measurements of structural dynamic responses. The model avoids overfitting, eliminates the need for regularization schemes, and allows for the use of heterogeneous and multi-fidelity data during the training process. The efficacy of the approach is demonstrated through the reconstruction of aerodynamic loads on the Great Belt East Bridge, simulated under a linear unsteady assumption. Results show a strong agreement between true and predicted loads, particularly related to root mean squared errors, magnitude, phase angle and peak values of the signals. The method for load reconstructing holds broad applicability, such as modeling validation, future load estimation, and structural damage prognosis.
Mathematicians stunned by AI's biggest breakthrough in mathematics yet
Mathematicians stunned by AI's biggest breakthrough in mathematics yet An 80-year-old maths conjecture that has eluded the world's greatest mathematicians has been cracked by an artificial intelligence model built by OpenAI. The result has stunned experts and is being hailed as a seismic moment for AI's mathematical ability. "This is a problem that I didn't expect to see solved in my lifetime," says Misha Rudnev at the University of Bristol, UK. "It's absolutely a bomb." Tim Gowers at the University of Cambridge wrote that the solution is "a milestone in AI mathematics" in a blog post accompanying the work . "If a human had written the paper and submitted it to the and I had been asked for a quick opinion, I would have recommended acceptance without any hesitation. No previous AI-generated proof has come close to that."
SURF: Steering the Scalarization Weight to Uniformly Traverse the Pareto Front
Jiang, Liuyuan, Huang, Chentong, Chen, Lisha
Scalarization is widely used in multi-objective optimization owing to its simplicity and scalability. In many applications, the goal is to generate solutions that represent diverse user preferences, ideally with uniform coverage of the Pareto front (PF). However, uniformly sampling scalarization weights usually induces non-uniform coverage of the PF. We explain this mismatch through a geometric analysis of the scalarization path. As the scalarization weight varies, the corresponding solutions trace the PF with a generally non-uniform traversal speed. This speed induces an arc-length cumulative distribution function (CDF); inverting this CDF map yields a principled rule for selecting weights that produce uniform PF coverage. Building on this insight, we propose SURF (Sampling Uniformly along the PaReto Front). For structured problems, including bi-objective bandits, we derive closed-form expressions for this CDF map and the resulting PF-aware weight sampling rule. For general problems, SURF alternates between CDF reconstruction and weight sampling. Theoretically, we show that under provable conditions, SURF converges linearly to an unavoidable finite-sampling floor. Empirically, experiments on bandits, multi-objective-gymnasium, and multi-objective LLM alignment demonstrate that SURF efficiently achieves more uniform PF coverage than baselines.
Conditioning Gaussian Processes on Almost Anything
Moss, Henry, Astfalck, Lachlan, Cowperthwaite, Thomas, Doumont, Colin, Willis, Sam, Hennig, Philipp, Nemeth, Christopher, Zammit-Mangion, Andrew
Gaussian processes (GPs) offer a principled probabilistic model over functions, but exact inference is restricted to the linear-Gaussian regime. We establish an explicit equivalence between GPs and a class of linear diffusion models, recasting predictive sampling as an ODE with closed-form Gaussian dynamics and a likelihood-dependent guidance term that admits a simple Monte Carlo approximation. In the linear-Gaussian setting, we recover standard GP conditioning exactly; beyond conjugacy, the same machinery handles any conditioning statement admitting point-wise likelihood evaluation -- including non-linear physics, and, for the first time, natural language via large language models. Whitening isolates the irreducible non-Gaussian dynamics, minimising Wasserstein-2 transport cost and eliminating numerical stiffness. The result is a general-purpose GP inference scheme requiring no bespoke derivations. Together, these results provide a general mechanism for incorporating the full richness of real-world knowledge as conditioning information, opening a new frontier for the probabilistic modelling of real-world problems.
StAD: Stein Amortized Divergence for Fast Likelihoods with Diffusion and Flow
Jagwani, Gurjeet, Thorp, Stephen, Deger, Sinan, Peiris, Hiranya
Diffusion and flow-based models are ubiquitously used for generative modelling and density estimation. They admit a deterministic probability flow ordinary differential equation (PF-ODE), analogous to continuous normalizing flows (CNFs), which describes the transport of the probability mass. Obtaining the likelihood from these models is of interest to many workflows, especially Bayesian analysis, and requires solving the trace of the Jacobian to compute the divergence of the learned PF-ODE, which is either $\mathcal{O}(D^2)$ to compute exactly or $\mathcal{O}(D)$ with a noisy estimate. We introduce StAD, a new distillation method to predict and learn the divergence of the PF-ODE using the Langevin-Stein operator without ever computing the Jacobian. We show that our method is competitive with the Hutchinson and Hutch++ on CIFAR-10, ImageNet and other density estimation tasks, consistently improving the variance and speed of the likelihood predictions compared to the Hutchinson. We additionally show our method will generalize to a varied class of generative models, and show that under some regularity conditions these learned vector fields can be made to satisfy the Stein class.
Satellites and AI used to track UK hedgehogs in bid to slow decline
Researchers at the University of Cambridge are using satellite data and AI in an effort to slow the decline in Britain's hedgehog population. Using an AI tool called Tessera, which analyses detailed images of the UK gathered from space, experts can precisely determine locations of hedgehog habitats - and where these are disappearing. The resulting maps capture landscapes in minute detail, including down to individual hedgerows, while AI can accurately predict hedgehog-friendly places obscured by cloud cover. Those behind the project hope it will help to shed light not just on where hedgehogs live across the UK, but barriers preventing them from finding food and mates. The researchers say Tessera's outputs can be used to track the impact of new housing developments and other environmental changes on landscapes that could affect hedgehogs over time.
Multifidelity Gaussian process regression for solving nonlinear partial differential equations
El-Boukkouri, Fatima-Zahrae, Garnier, Josselin, Roustant, Olivier
Solving nonlinear partial differential equations (PDEs) using kernel methods offers a compelling alternative to traditional numerical solvers. However, the performance of these methods strongly depends on the choice of kernel. In this work, as the available information is inherently multifidelity, we propose a kernel learning approach based on cokriging, leveraging empirical information from multifidelity simulations. In the first step, we fit a differentiable non-stationary kernel to an empirical kernel obtained from low-fidelity simulations. In the second step, we derive a high-fidelity kernel with estimated hyperparameters, and construct a corresponding high-fidelity mean using the multifidelity framework. These components can then be used within a Gaussian process framework for solving PDEs. Finally, we demonstrate the performance of the proposed physics-informed method on the Burgers' equation.