Goto

Collaborating Authors

 Asia


The video game where you play as JESUS: Open-world simulator lets you 'follow the path' of the Messiah as he is baptized, perform miracles, fights the Devil and gets CRUCIFIED

Daily Mail - Science & tech

Ritzy Bay Area town torn apart after teacher's daughter, 16, was behind wheel when four friends died in high-speed crash... then she posted a TikTok video that poured fuel on the flames Two CIA officers killed in Mexico when their car skidded off ravine and exploded after meeting about bust of'largest ever drug lab' Nancy Guthrie sheriff's appalling past revealed: Beat handcuffed suspect so badly he needed intensive care, used VILE language about woman and lied in sworn statement Trump confronts Xi as US forces seize Chinese ship carrying mysterious'gift' to Iran New'Hollywood dose' pill: A-listers hooked on'youth elixir' that dermatologists say is anti-ageing, shrinks pores, smooths wrinkles... and even banishes rosacea Days after we got engaged, the love of my life told me he'd killed a man and buried him in a bog. I reported him to police... but then I made this irreversible mistake Ark of the Covenant's final resting place pinpointed by archaeologists as fresh search begins Fury as murderer marries pen pal behind bars... as teenage victim's mom says: 'I'm serving a life sentence without my son' Insiders claim failed AI rollout could be to blame for Tim Cook's departure from Apple - as one says'the AI era requires a different kind of leadership' Life-threatening cantaloupe recall in four states upgraded to FDA's highest risk level... 'reasonable probability of death' AMANDA PLATELL: Why Sarah Ferguson - with the ghost of Princess Diana at her side - is ready to sensationally blow up the Royal Family. She knows ALL their secrets... Team USA Olympics star Noah Lyles slammed for'horrible' reaction to his wife's wedding dress reveal In honour of the Queen's (purple!) reign: Kate mirrors late monarch's colourful wardrobe and wears her pearl earrings and necklace US troops board second tanker as Iran is accused of breaking ceasefire'numerous times' How to lose weight when perimenopause sabotages your metabolism: I'm a trainer but when I hit 46, I piled on the pounds overnight. The new'posh' drug that's easier to order than Uber Eats - and why all my middle-class friends have ditched booze and cocaine for it: JANA HOCKING Autistic woman, 24, worked hard to build independent life for herself... now she's PARALYZED thanks to selfishness of stranger The video game where you play as JESUS: Open-world simulator lets you'follow the path' of the Messiah as he is baptized, perform miracles, fights the Devil and gets CRUCIFIED READ MORE: Rare marble artefact discovered in the'cradle of Christianity' While fans eagerly await the release of Grand Theft Auto 6, one Polish game studio has unveiled a rather unorthodox alternative. The video game, titled'I Am Jesus Christ' allows players to'walk in the footsteps of Jesus' in an immersive first-person retelling of the gospel.


The secret project to settle controversial maths proof with a computer

New Scientist

One of the most bitterly contested proofs in modern mathematics may be on the verge of being untangled. Two projects, both aiming to use a computer program to cast new light on the controversy, are now up and running - with one having operated in secret for more than two years already. The developments are a positive sign that the row might find a solution, say mathematicians. The saga began in 2012 when Shinichi Mochizuki at Kyoto University, Japan, claimed to have proved a famous idea called the ABC conjecture, posting a 500-page proof online. The conjecture is simple to state, concerning prime numbers involved in solutions to the equation a + b = c and how these numbers relate to each other.


The man who ruined mathematics

New Scientist

Gödel's seminal work directly contradicted one of the great minds of mathematics and limited the field forever Kurt Gödel, the man who ruined mathematics, was one of the most important thinkers of the 20th century. He was born in 1906, smack-bang in the middle of the greatest crisis that maths has ever known. Just a few decades later, he would help resolve this turmoil, but in doing so doom mathematicians to a smaller world than the one that came before. Mathematics, as an intellectual framework, is incredibly powerful. The entire point is taking one set of logical ideas and using them to build another, making maths the closest thing we have to a cognitive perpetual-motion machine - there is always a new mathematical idea lurking across the horizon, and we just need to assemble the steps to get there.


Computationally lightweight classifiers with frequentist bounds on predictions

arXiv.org Machine Learning

While both classical and neural network classifiers can achieve high accuracy, they fall short on offering uncertainty bounds on their predictions, making them unfit for safety-critical applications. Existing kernel-based classifiers that provide such bounds scale with $\mathcal O (n^{\sim3})$ in time, making them computationally intractable for large datasets. To address this, we propose a novel, computationally efficient classification algorithm based on the Nadaraya-Watson estimator, for whose estimates we derive frequentist uncertainty intervals. We evaluate our classifier on synthetically generated data and on electrocardiographic heartbeat signals from the MIT-BIH Arrhythmia database. We show that the method achieves competitive accuracy $>$\SI{96}{\percent} at $\mathcal O(n)$ and $\mathcal O(\log n)$ operations, while providing actionable uncertainty bounds. These bounds can, e.g., aid in flagging low-confidence predictions, making them suitable for real-time settings with resource constraints, such as diagnostic monitoring or implantable devices.


Massively Parallel Exact Inference for Hawkes Processes

arXiv.org Machine Learning

Multivariate Hawkes processes are a widely used class of self-exciting point processes, but maximum likelihood estimation naively scales as $O(N^2)$ in the number of events. The canonical linear exponential Hawkes process admits a faster $O(N)$ recurrence, but prior work evaluates this recurrence sequentially, without exploiting parallelization on modern GPUs. We show that the Hawkes process intensity can be expressed as a product of sparse transition matrices admitting a linear-time associative multiply, enabling computation via a parallel prefix scan. This yields a simple yet massively parallelizable algorithm for maximum likelihood estimation of linear exponential Hawkes processes. Our method reduces the computational complexity to approximately $O(N/P)$ with $P$ parallel processors, and naturally yields a batching scheme to maintain constant memory usage, avoiding GPU memory constraints. Importantly, it computes the exact likelihood without any additional assumptions or approximations, preserving the simplicity and interpretability of the model. We demonstrate orders-of-magnitude speedups on simulated and real datasets, scaling to thousands of nodes and tens of millions of events, substantially beyond scales reported in prior work. We provide an open-source PyTorch library implementing our optimizations.


Unified Precision-Guaranteed Stopping Rules for Contextual Learning

arXiv.org Machine Learning

Contextual learning seeks to learn a decision policy that maps an individual's characteristics to an action through data collection. In operations management, such data may come from various sources, and a central question is when data collection can stop while still guaranteeing that the learned policy is sufficiently accurate. We study this question under two precision criteria: a context-wise criterion and an aggregate policy-value criterion. We develop unified stopping rules for contextual learning with unknown sampling variances in both unstructured and structured linear settings. Our approach is based on generalized likelihood ratio (GLR) statistics for pairwise action comparisons. To calibrate the corresponding sequential boundaries, we derive new time-uniform deviation inequalities that directly control the self-normalized GLR evidence and thus avoid the conservativeness caused by decoupling mean and variance uncertainty. Under the Gaussian sampling model, we establish finite-sample precision guarantees for both criteria. Numerical experiments on synthetic instances and two case studies demonstrate that the proposed stopping rules achieve the target precision with substantially fewer samples than benchmark methods. The proposed framework provides a practical way to determine when enough information has been collected in personalized decision problems. It applies across multiple data-collection environments, including historical datasets, simulation models, and real systems, enabling practitioners to reduce unnecessary sampling while maintaining a desired level of decision quality.


Score Shocks: The Burgers Equation Structure of Diffusion Generative Models

arXiv.org Machine Learning

We analyze the score field of a diffusion generative model through a Burgers-type evolution law. For VE diffusion, the heat-evolved data density implies that the score obeys viscous Burgers in one dimension and the corresponding irrotational vector Burgers system in $\R^d$, giving a PDE view of \emph{speciation transitions} as the sharpening of inter-mode interfaces. For any binary decomposition of the noised density into two positive heat solutions, the score separates into a smooth background and a universal $\tanh$ interfacial term determined by the component log-ratio; near a regular binary mode boundary this yields a normal criterion for speciation. In symmetric binary Gaussian mixtures, the criterion recovers the critical diffusion time detected by the midpoint derivative of the score and agrees with the spectral criterion of Biroli, Bonnaire, de~Bortoli, and Mézard (2024). After subtracting the background drift, the inter-mode layer has a local Burgers $\tanh$ profile, which becomes global in the symmetric Gaussian case with width $σ_τ^2/a$. We also quantify exponential amplification of score errors across this layer, show that Burgers dynamics preserves irrotationality, and use a change of variables to reduce the VP-SDE to the VE case, yielding a closed-form VP speciation time. Gaussian-mixture formulas are verified to machine precision, and the local theorem is checked numerically on a quartic double-well.


Order-Optimal Sequential 1-Bit Mean Estimation in General Tail Regimes

arXiv.org Machine Learning

In this paper, we study the problem of mean estimation under strict 1-bit communication constraints. We propose a novel adaptive mean estimator based solely on randomized threshold queries, where each 1-bit outcome indicates whether a given sample exceeds a sequentially chosen threshold. Our estimator is $(ε, δ)$-PAC for any distribution with a bounded mean $μ\in [-λ, λ]$ and a bounded $k$-th central moment $\mathbb{E}[|X-μ|^k] \le σ^k$ for any fixed $k > 1$. Crucially, our sample complexity is order-optimal in all such tail regimes, i.e., for every such $k$ value. For $k \neq 2$, our estimator's sample complexity matches the unquantized minimax lower bounds plus an unavoidable $O(\log(λ/σ))$ localization cost. For the finite-variance case ($k=2$), our estimator's sample complexity has an extra multiplicative $O(\log(σ/ε))$ penalty, and we establish a novel information-theoretic lower bound showing that this penalty is a fundamental limit of 1-bit quantization. We also establish a significant adaptivity gap: for both threshold queries and more general interval queries, the sample complexity of any non-adaptive estimator must scale linearly with the search space parameter $λ/σ$, rendering it vastly less sample efficient than our adaptive approach. Finally, we present algorithmic variants that (i) handle an unknown sampling budget, (ii) adapt to an unknown scale parameter~$σ$ given (possibly loose) bounds, and (iii) require only two stages of adaptivity at the expense of more complicated general 1-bit queries.


From Ground Truth to Measurement: A Statistical Framework for Human Labeling

arXiv.org Machine Learning

Supervised machine learning assumes that labeled data provide accurate measurements of the concepts models are meant to learn. Yet in practice, human labeling introduces systematic variation arising from ambiguous items, divergent interpretations, and simple mistakes. Machine learning research commonly treats all disagreement as noise, which obscures these distinctions and limits our understanding of what models actually learn. This paper reframes annotation as a measurement process and introduces a statistical framework for decomposing labeling outcomes into interpretable sources of variation: instance difficulty, annotator bias, situational noise, and relational alignment. The framework extends classical measurement-error models to accommodate both shared and individualized notions of truth, reflecting traditional and human label variation interpretations of error, and provides a diagnostic for assessing which regime better characterizes a given task. Applying the proposed model to a multi-annotator natural language inference dataset, we find empirical evidence for all four theorized components and demonstrate the effectiveness of our approach. We conclude with implications for data-centric machine learning and outline how this approach can guide the development of a more systematic science of labeling.


Differentially Private Language Generation and Identification in the Limit

arXiv.org Machine Learning

We initiate the study of language generation in the limit, a model recently introduced by Kleinberg and Mullainathan [KM24], under the constraint of differential privacy. We consider the continual release model, where a generator must eventually output a stream of valid strings while protecting the privacy of the entire input sequence. Our first main result is that for countable collections of languages, privacy comes at no qualitative cost: we provide an $\varepsilon$-differentially-private algorithm that generates in the limit from any countable collection. This stands in contrast to many learning settings where privacy renders learnability impossible. However, privacy does impose a quantitative cost: there are finite collections of size $k$ for which uniform private generation requires $Ω(k/\varepsilon)$ samples, whereas just one sample suffices non-privately. We then turn to the harder problem of language identification in the limit. Here, we show that privacy creates fundamental barriers. We prove that no $\varepsilon$-DP algorithm can identify a collection containing two languages with an infinite intersection and a finite set difference, a condition far stronger than the classical non-private characterization of identification. Next, we turn to the stochastic setting where the sample strings are sampled i.i.d. from a distribution (instead of being generated by an adversary). Here, we show that private identification is possible if and only if the collection is identifiable in the adversarial model. Together, our results establish new dimensions along which generation and identification differ and, for identification, a separation between adversarial and stochastic settings induced by privacy constraints.