Goto

Collaborating Authors

 Oceania


Proof-of-age ID leaked in Discord data breach

The Guardian

The personal information of users of the video game chat platform has been compromised. The personal information of users of the video game chat platform has been compromised. Video game chat platform tells users that driver's licences and passports were among the forms of data accessed via a third-party customer service provider Video game chat platform Discord has suffered a data breach, informing users that their personal information - including identity documents of those required to prove their age - were compromised. The company stated last week that an unauthorised party had compromised one of Discord's third-party customer service providers, leading to the access of "a limited number of users" who had been in contact with the customer service or trust and safety teams. The data compromised may have included usernames, email, billing information, the last four digits of credit card numbers, IP addresses and messages with customer support.


Robin Williams' daughter Zelda hits out at AI-generated videos of her dead father: 'stop doing this to him'

The Guardian

Zelda has asked people to stop sending her AI videos of her father, who died in 2014 at the age of 63. Zelda has asked people to stop sending her AI videos of her father, who died in 2014 at the age of 63. Film-maker tells the public to stop sending her videos, saying: 'You're not making art, you're making disgusting, over-processed hotdogs out of the lives of human beings' Zelda Williams, the daughter of the late actor and comedian Robin Williams, has spoken out against AI-generated content featuring her father. "Please, just stop sending me AI videos of Dad," Zelda wrote in an Instagram story on Monday . "Stop believing I wanna see it or that I'll understand, I don't and I won't. If you're just trying to troll me, I've seen way worse, I'll restrict and move on. But please, if you've got any decency, just stop doing this to him and to me, to everyone even, full stop. It's dumb, it's a waste of time and energy, and believe me, it's NOT what he'd want. "To watch the legacies of real people be condensed down to'this vaguely looks and sounds like them so that's enough', just so other people can churn out horrible TikTok slop puppeteering them is maddening.


ResCP: Reservoir Conformal Prediction for Time Series Forecasting

arXiv.org Machine Learning

Conformal prediction offers a powerful framework for building distribution-free prediction intervals for exchangeable data. Existing methods that extend conformal prediction to sequential data rely on fitting a relatively complex model to capture temporal dependencies. However, these methods can fail if the sample size is small and often require expensive retraining when the underlying data distribution changes. To overcome these limitations, we propose Reservoir Conformal Prediction (ResCP), a novel training-free conformal prediction method for time series. Our approach leverages the efficiency and representation learning capabilities of reservoir computing to dynamically reweight conformity scores. In particular, we compute similarity scores among reservoir states and use them to adaptively reweight the observed residuals at each step. With this approach, ResCP enables us to account for local temporal dynamics when modeling the error distribution without compromising computational scalability. We prove that, under reasonable assumptions, ResCP achieves asymptotic conditional coverage, and we empirically demonstrate its effectiveness across diverse forecasting tasks.


Score-based generative emulation of impact-relevant Earth system model outputs

arXiv.org Machine Learning

Policy targets evolve faster than the Couple Model Intercomparison Project cycles, complicating adaptation and mitigation planning that must often contend with outdated projections. Climate model output emulators address this gap by offering inexpensive surrogates that can rapidly explore alternative futures while staying close to Earth System Model (ESM) behavior. We focus on emulators designed to provide inputs to impact models. Using monthly ESM fields of near-surface temperature, precipitation, relative humidity, and wind speed, we show that deep generative models have the potential to model jointly the distribution of variables relevant for impacts. The specific model we propose uses score-based diffusion on a spherical mesh and runs on a single mid-range graphical processing unit. We introduce a thorough suite of diagnostics to compare emulator outputs with their parent ESMs, including their probability densities, cross-variable correlations, time of emergence, or tail behavior. We evaluate performance across three distinct ESMs in both pre-industrial and forced regimes. The results show that the emulator produces distributions that closely match the ESM outputs and captures key forced responses. They also reveal important failure cases, notably for variables with a strong regime shift in the seasonal cycle. Although not a perfect match to the ESM, the inaccuracies of the emulator are small relative to the scale of internal variability in ESM projections. We therefore argue that it shows potential to be useful in supporting impact assessment. We discuss priorities for future development toward daily resolution, finer spatial scales, and bias-aware training. Code is made available at https://github.com/shahineb/climemu.


Self-Speculative Masked Diffusions

arXiv.org Machine Learning

We present self-speculative masked diffusions, a new class of masked diffusion generative models for discrete data that require significantly fewer function evaluations to generate samples. Standard masked diffusion models predict factorized logits over currently masked positions. A number of masked positions are then sampled, however, the factorization approximation means that sampling too many positions in one go leads to poor sample quality. As a result, many simulation steps and therefore neural network function evaluations are required to generate high-quality data. We reduce the computational burden by generating non-factorized predictions over masked positions. This is achieved by modifying the final transformer attention mask from non-causal to causal, enabling draft token generation and parallel validation via a novel, model-integrated speculative sampling mechanism. This results in a non-factorized predictive distribution over masked positions in a single forward pass. We apply our method to GPT2 scale text modelling and protein sequences generation, finding that we can achieve a ~2x reduction in the required number of network forward passes relative to standard masked diffusion models.


The analogy theorem in Hoare logic

arXiv.org Machine Learning

The introduction of machine learning methods has led to significant advances in automation, optimization, and discoveries in various fields of science and technology. However, their widespread application faces a fundamental limitation: the transfer of models between data domains generally lacks a rigorous mathematical justification. The key problem is the lack of formal criteria to guarantee that a model trained on one type of data will retain its properties on another.This paper proposes a solution to this problem by formalizing the concept of analogy between data sets and models using first-order logic and Hoare logic.We formulate and rigorously prove a theorem that sets out the necessary and sufficient conditions for analogy in the task of knowledge transfer between machine learning models. Practical verification of the analogy theorem on model data obtained using the Monte Carlo method, as well as on MNIST and USPS data, allows us to achieving F1 scores of 0.84 and 0.88 for convolutional neural networks and random forests, respectively.The proposed approach not only allows us to justify the correctness of transfer between domains but also provides tools for comparing the applicability of models to different types of data.The main contribution of the work is a rigorous formalization of analogy at the level of program logic, providing verifiable guarantees of the correctness of knowledge transfer, which opens new opportunities for both theoretical research and the practical use of machine learning models in previously inaccessible areas.


Automating construction safety inspections using a multi-modal vision-language RAG framework

arXiv.org Artificial Intelligence

Conventional construction safety inspection methods are often inefficient as they require navigating through large volume of information. Recent advances in large vision-language models (LVLMs) provide opportunities to automate safety inspections through enhanced visual and linguistic understanding. However, existing applications face limitations including irrelevant or unspecific responses, restricted modal inputs and hallucinations. Utilisation of Large Language Models (LLMs) for this purpose is constrained by availability of training data and frequently lack real-time adaptability. This study introduces SiteShield, a multi-modal LVLM-based Retrieval-Augmented Generation (RAG) framework for automating construction safety inspection reports by integrating visual and audio inputs. Using real-world data, SiteShield outperformed unimodal LLMs without RAG with an F1 score of 0.82, hamming loss of 0.04, precision of 0.76, and recall of 0.96. The findings indicate that SiteShield offers a novel pathway to enhance information retrieval and efficiency in generating safety reports.


Optimising Battery Energy Storage System Trading via Energy Market Operator Price Forecast

arXiv.org Artificial Intelligence

In electricity markets around the world, the ability to anticipate price movements with precision can be the difference between profit and loss, especially for fast-acting assets like battery energy storage systems (BESS). As grid volatility increases due to renewables and market decentralisation, operators and forecasters alike face growing pressure to transform prediction into strategy. Yet while forecast data is abundant, especially in advanced markets like Australia's National Electricity Market (NEM), its practical value in driving real-world BESS trading decisions remains largely unexplored. This thesis dives into that gap. This work addresses a key research question: Can the accuracy of the Australian Energy Market Operator (AEMO) energy price forecasts be systematically leveraged to develop a reliable and profitable battery energy storage system trading algorithm? Despite the availability of AEMO price forecasts, no existing framework evaluates their reliability or incorporates them into practical BESS trading strategies. By analysing patterns in forecast accuracy based on time of day, forecast horizon, and regional variations, this project creates a novel, forecast-informed BESS trading model to optimise arbitrage financial returns. The performance of this forecast-driven algorithm is benchmarked against a basic trading algorithm with no knowledge of forecast data. The study further explores the potential of machine learning techniques to predict future energy prices by enhancing AEMO forecasts to govern a more advanced trading strategy. The research outcomes will inform future improvements in energy market trading models and promote more efficient BESS integration into market operations.


New Supreme Court term will reshape Trump's powers

BBC News

New Supreme Court term will reshape Trump's powers The US Supreme Court begins its new term on Monday with a docket already full of potentially significant cases that could define the scope of Donald Trump's presidential authority - and the prospect of more to come. In the eight months that Trump has been back in the White House, he has tested the limits of executive power, unilaterally implementing new policies, slashing federal budgets and workforce, and attempting to bring previously independent agencies and institutions more directly under his control. The latest brewing legal battle comes from the president's attempts to take control of state National Guard units and deploy them in cities where he claims there is public unrest and rampant crime - over the objection of local and state officials. In Oregon, a federal judge has issued orders blocking Trump's deployment of troops to Portland. An appeals court is set to review the move in the coming days.


Newly discovered deep-sea lanternshark glows in the waters near Australia

Popular Science

The tiny shark and a ghost-like crab are two of the latest species uncovered in a yearslong expedition. Breakthroughs, discoveries, and DIY tips sent every weekday. Oceanographers scouring the waters off of Western Australia have discovered two new deep-sea oddities . On October 6, Australia's Commonwealth Scientific and Industrial Research Organization (CSIRO) showcased these new species originally collected in 2022: a bioluminescent lanternshark and a tiny, semi-translucent porcelain crab . The team revealed two of its initial finds--the painted hornshark and the ridged-egg catshark --in 2023.