Goto

Collaborating Authors

 Europe


A Danish Couple's Maverick African Research Finds Its Moment in RFK Jr.'s Vaccine Policy

WIRED

The work of Peter Aaby and Christine Stabell Benn has long been controversial. Until Robert F. Kennedy Jr. became US health policy chief, most vaccine scientists tended to ignore it. In 1996, Guinea-Bissau seemed like an ideal research post for budding pediatrician Lone Graff Stensballe. Her supervisor, a fellow Dane named Peter Aaby, had spent nearly two decades collecting data on 100,000 people living in the mud brick homes of the West African country's capital. Aaby and his partner, Christine Stabell Benn, believed that the years of research in the impoverished country had yielded a major discovery about vaccines--and what they described as "non-specific effects": The measles and tuberculosis vaccines, which were derived from live, weakened viruses and bacteria, they said, boosted child survival beyond protecting against those particular pathogens. But, the scientists said, shots made from deactivated whole germs, or pieces of them, such as the diphtheria-tetanus-pertussis (DTP) shot, caused more deaths--especially in little girls--than getting no vaccine at all.


Escaped tiger shot by German police after attacking man

BBC News

An escaped tiger believed to be owned by Germany's Tiger Queen has been shot dead by police after attacking one of its keepers, according to local media reports. Police say a 73-year-old man was seriously injured after being attacked on Sunday while he was inside the animal's enclosure, located in a privately-owned facility on the outskirts of the German city of Leipzig. The tiger escaped the enclosure and was found shortly after by armed police, who shot and killed the animal. The site of the enclosure is believed to be owned by controversial trainer and private owner Carmen Zander, who describes herself as Germany's Tiger Queen. The animal was one of eight big cats kept at the industrial site near the German town of Schkeuditz, according to local media.


Instant AI answers can trivialise human intelligence, warns Royal Observatory

BBC News

The rise of AI tools that instantly answer questions and complex problems could make humans less intelligent, the Royal Observatory Greenwich has warned. The Observatory, one of the UK's oldest purpose-built scientific institutions, is known for its contributions to astronomy. Paddy Rodgers, director of the Royal Museums Greenwich group which oversees it, said its rich history of research showed the power of human knowledge and curiosity - and the need to avoid complete dependence on AI. A reliance solely on instant answers risks losing the habits of questioning and evaluation that underpin knowledge, expertise and innovation, he said. Rodgers' remarks come amid an ongoing transformation of the Royal Observatory in a project called First Light. The project hopes to seize on the passion of all the astronomers over the last 350 years, and interpret that passion through science, Rodgers told the BBC.


Estimating the expected output of wide random MLPs more efficiently than sampling

arXiv.org Machine Learning

By far the most common way to estimate an expected loss in machine learning is to draw samples, compute the loss on each one, and take the empirical average. However, sampling is not necessarily optimal. Given an MLP at initialization, we show how to estimate its expected output over Gaussian inputs without running samples through the network at all. Instead, we produce approximate representations of the distributions of activations at each layer, leveraging tools such as cumulants and Hermite expansions. We show both theoretically and empirically that for sufficiently wide networks, our estimator achieves a target mean squared error using substantially fewer FLOPs than Monte Carlo sampling. We find moreover that our methods perform particularly well at estimating the probabilities of rare events, and additionally demonstrate how they can be used for model training. Together, these findings suggest a path to producing models with a greatly reduced probability of catastrophic tail risks.


MaxSketch: Robust Distinct Counting in Streams via Random Projections

arXiv.org Machine Learning

Estimating the number of distinct elements in a data stream is well understood when repeated elements are identical. In modern settings, however, observations are high-dimensional and noisy, so repeated instances of the same object are only approximately similar -- for example, different images of the same individual may vary significantly at the pixel level. Classical sketches such as HyperLogLog rely on consistent hash values for identical elements and break down in this regime. Recent work on robust distinct counting in general metric spaces achieves $\widetildeฮ˜(\sqrt{n})$ memory, which is tight in the worst case. We show that substantially improved memory guarantees are possible under geometric structure common in learned representations. We introduce MaxSketch, a simple max-linear sketch built from random Gaussian projections, and prove that it succeeds in estimating the number of distinct latent objects. Concretely, we show that under this assumption $m = \widetilde{O} (\log n / \varepsilon^2)$ random projections (and hence $\widetilde{O} (\log n/\varepsilon^2)$ memory) suffice to recover the true distinct count within a $(1+\varepsilon)$ factor. Experiments on image streams confirm that MaxSketch accurately estimates distinct counts and generalizes beyond the training regime. Our results bridge classical streaming algorithms and modern representation learning, showing how geometric structure can fundamentally reduce the complexity of distinct counting.


$ฮฑ$-TCAV: A Unified Framework for Testing with Concept Activation Vectors

arXiv.org Machine Learning

Concept Activation Vectors (CAVs) are a fundamental tool for concept-based explainability in deep learning, yet their practical utility is limited by statistical instability. We analyze the stochastic nature of CAVs and the Testing with CAVs (TCAV) method, deriving the distributions of major CAV classes including PatternCAV, FastCAV, and ridge regression-based CAVs. We then identify a fundamental flaw in the standard TCAV score: its reliance on a discontinuous indicator function induces non-decaying variance in critical regimes. To address this, we introduce $ฮฑ$-TCAV, a generalized framework that replaces the indicator with a parameterized smooth function, yielding a unified probabilistic formulation that subsumes both TCAV and Multi-TCAV. We characterize the induced distributions of sensitivity scores and different TCAV variants, showing that established state-of-the-art choices lack theoretical justification. We provide principled guidance on tuning the parameter in $ฮฑ$-TCAV -- either to imitate Multi-TCAV at substantially lower computational cost, or to obtain a calibrated Bayes-optimal probabilistic measure of a concept's influence. Finally, our analysis yields practical recommendations that challenge established routines: most notably, allocating the full sampling budget to a single CAV rather than splitting it across several.


Intrinsic Wasserstein Rates for Score-Based Generative Models on Smooth Manifolds

arXiv.org Machine Learning

Score-based generative models are trained in high-dimensional ambient spaces, yet many data distributions are supported on low-dimensional nonlinear structures. We prove that, for compact $d$-dimensional smooth manifolds $\mathcal{M} \subset [0,1]^D$ with $d > 2$ and $ฮฒ$-Hรถlder densities strictly positive on $\mathcal{M}$, a variance-preserving SGM estimator attains the intrinsic Wasserstein--1 sample exponent $\tilde{\mathcal{O}}(D^{\mathcal{O}_ฮฒ(d)}n^{-(ฮฒ+1)/(d+2ฮฒ)})$, up to logarithmic factors and explicit geometry and density factors. The full nonasymptotic bound explicitly isolates the finite-order geometry envelope, Hรถlder radius, density lower bound, ambient dependence, and finite-order correction terms. The analysis separates score approximation into a large-noise tangent-cell regime and a small-noise projection-centered, de-Gaussianized Laplace regime. The key technical ingredient is a ReLU implementation of nearest-projection coordinates via finite intrinsic anchors and Gauss--Newton iterations, rather than approximating the manifold projection as a black-box high-dimensional smooth map. Consequently, for families with polynomially controlled geometry and density lower bounds, the constructed score-network parameters have polynomial ambient dependence.


Testing properties of trees in graphical models with covariance queries

arXiv.org Machine Learning

We consider the problem of testing properties of graphs underlying high-dimensional graphical models. We adopt the model of covariance queries introduced by Lugosi, Truszkowski, Velona, and Zwiernik (2021). We study the case when the underlying graph is a tree. The main results of the paper show that, while reconstructing the entire tree may be costly, certain global structural properties can be tested efficiently. In particular, we design randomized tests for global structural properties that use a sub-quadratic number of queries. We develop testing procedures for several fundamental properties, including the number of leaves, the maximum degree, the typical distance, and the diameter of the tree. For each property, we obtain explicit query complexity bounds that depend on the target threshold and tolerance parameters.


Explainable AI Isn't Enough! Rethinking Algorithmic Contestability

arXiv.org Machine Learning

Machine learning systems increasingly make life-changing decisions about individuals, such as loan approvals, hiring, and cheating detection, raising a pressing question: how can individuals respond to negative decisions made by these opaque systems? While explainable artificial intelligence (XAI) has largely focused on algorithmic recourse -- helping individuals change their features to obtain a desired outcome -- the parallel problem of algorithmic contestability -- helping individuals review and correct erroneous algorithmic decisions -- has received far less attention, despite its central ethical and legal importance. We trace this neglect to the absence of clear formal definitions and a systematic operationalization of contestability as an algorithmic problem. To address it, we propose an operational definition of contestability as a natural complement to recourse: contestability starts from the presumption that a decision may be incorrect and focuses on identifying evidence to challenge and potentially overturn it, whereas recourse assumes the decision is valid and instead provides pathways for changing it. We show that standard XAI explanations, such as counterfactuals, LIME, or Anchors, even when combined with human intuitions about decision continuity or monotonicity, reveal only errors in the neighborhood of the individual, but provide insufficient grounds for overturning the decision at hand. Going thus beyond traditional XAI, we identify three types of evidence warranting reversal according to the decision maker's own ethical standards: predictive multiplicity, incorrect feature values, and neglected overruling evidence. We argue that these render decisions normatively indefensible and thus successfully contestable. Finally, we analyze how existing EU legislation connects to our framework and argue that individuals already hold some legal rights to these forms of evidence.


SAFE Quantum Machine Learning with Variational Quantum Classifiers

arXiv.org Machine Learning

We propose a variational quantum classifier operating on high dimensional deep representations via amplitude encoding, stabilized by a learnable classical pre encoding layer.By combining normalized amplitude embeddings with bounded quantum observables, the resulting model induces a structured and smooth hypothesis class with controlled sensitivity to input variations. Model reliability is assessed using SAFE-AI metrics derived from the Cramer von Mises divergence, enabling consistent evaluation across accuracy, robustness, and explainability dimensions. Empirical results show that the proposed quantum model provides competitive predictive performance compared with strong classical baselines while exhibiting a more balanced SAFE reliability profile, with improved robustness to noise and stability under structured feature removal. These findings suggest that variational quantum circuits offer a principled mechanism for stability oriented SAFE learning in safety critical settings.