Goto

Collaborating Authors

 Law


SHAP-based Explanations are Sensitive to Feature Representation

arXiv.org Artificial Intelligence

Local feature-based explanations are a key component of the XAI toolkit. These explanations compute feature importance values relative to an ``interpretable'' feature representation. In tabular data, feature values themselves are often considered interpretable. This paper examines the impact of data engineering choices on local feature-based explanations. We demonstrate that simple, common data engineering techniques, such as representing age with a histogram or encoding race in a specific way, can manipulate feature importance as determined by popular methods like SHAP. Notably, the sensitivity of explanations to feature representation can be exploited by adversaries to obscure issues like discrimination. While the intuition behind these results is straightforward, their systematic exploration has been lacking. Previous work has focused on adversarial attacks on feature-based explainers by biasing data or manipulating models. To the best of our knowledge, this is the first study demonstrating that explainers can be misled by standard, seemingly innocuous data engineering techniques.


Mirror Mirror on the Wall, Have I Forgotten it All? A New Framework for Evaluating Machine Unlearning

arXiv.org Artificial Intelligence

Machine unlearning methods take a model trained on a dataset and a forget set, then attempt to produce a model as if it had only been trained on the examples not in the forget set. We empirically show that an adversary is able to distinguish between a mirror model (a control model produced by retraining without the data to forget) and a model produced by an unlearning method across representative unlearning methods from the literature. We build distinguishing algorithms based on evaluation scores in the literature (i.e. membership inference scores) and Kullback-Leibler divergence. We propose a strong formal definition for machine unlearning called computational unlearning. Computational unlearning is defined as the inability for an adversary to distinguish between a mirror model and a model produced by an unlearning method. If the adversary cannot guess better than random (except with negligible probability), then we say that an unlearning method achieves computational unlearning. Our computational unlearning definition provides theoretical structure to prove unlearning feasibility results. For example, our computational unlearning definition immediately implies that there are no deterministic computational unlearning methods for entropic learning algorithms. We also explore the relationship between differential privacy (DP)-based unlearning methods and computational unlearning, showing that DP-based approaches can satisfy computational unlearning at the cost of an extreme utility collapse. These results demonstrate that current methodology in the literature fundamentally falls short of achieving computational unlearning. We conclude by identifying several open questions for future work.


Border state law enforcement to shoot down 'weaponized' drug-smuggling drones

FOX News

Raul Gastesi speaks with Fox News Digital about a bill moving through the Florida Senate that would give homeowners the right to use "reasonable force" to take down drones infringing on their privacy rights. A newly-minted law allowing Arizona law enforcement officers to shoot down drug-carrying drones along the U.S.-Mexico border has taken effect after sailing through the state's legislature with bipartisan support. HB 2733 was signed into law on April 18 and grants officers the ability to target drones suspected of carrying out illegal activity within 15 miles of the state's international border. "Cartels are increasingly using drones to survey the border to locate [U.S. Customs and Border Protection] officers' locations and to transport illegal drugs from Mexico into our state," state Rep. David Marshall, the bill's sponsor, said in a statement to Fox News Digital. "Law enforcement tools at [our] disposal will be electronic jamming devices, as well as using shotguns with bird shot to bring down these drones."


ChatGPT Turned Into a Studio Ghibli Machine. How Is That Legal?

The Atlantic - Technology

A few weeks ago, OpenAI pulled off one of the greatest corporate promotions in recent memory. Whereas the initial launch of ChatGPT, back in 2022, was "one of the craziest viral moments i'd ever seen," CEO Sam Altman wrote on social media, the response to a new upgrade was, in his words, "biblical": 1 million users supposedly signed up to use the chatbot in just one hour, Altman reported, thanks to a new, more permissive image-generating capability that could imitate the styles of various art and design studios. Altman called it "a new high-water mark for us in allowing creative freedom." Almost immediately, images began to flood the internet. The most popular style, by a long shot, was that of Studio Ghibli, the Japanese animation studio co-founded by Hayao Miyazaki and widely beloved for films such as Spirited Away and Princess Mononoke.


Trump strikes a blow for AI – by firing the US copyright supremo

The Guardian

Sometimes it helps me to write by thinking about how a radio broadcaster or television presenter would deliver the information, so I'm your host, Blake Montgomery. Today in tech news: questions hover over the automation of labor in the worker-strapped US healthcare system; and drones proliferate in a new conflict: India v Pakistan, both armed with nuclear weapons. Meanwhile, in contrast to a thoughtful and robust conversation, the US is taking the opposite tack. Legend has it that Alexander the Great was presented with a knot in a rope tying a cart to a stake. So complex were its twistings that no man had been able to untie it of the hundreds who had tried. Alexander silently drew his sword and sliced the knot in two.


Police tech can sidestep facial recognition bans now

MIT Technology Review

Companies like Flock and Axon sell suites of sensors--cameras, license plate readers, gunshot detectors, drones--and then offer AI tools to make sense of that ocean of data (at last year's conference I saw schmoozing between countless AI-for-police startups and the chiefs they sell to on the expo floor). Departments say these technologies save time, ease officer shortages, and help cut down on response times. Those sound like fine goals, but this pace of adoption raises an obvious question: Who makes the rules here? When does the use of AI cross over from efficiency into surveillance, and what type of transparency is owed to the public? In some cases, AI-powered police tech is already driving a wedge between departments and the communities they serve.


Towards Artificial General or Personalized Intelligence? A Survey on Foundation Models for Personalized Federated Intelligence

arXiv.org Artificial Intelligence

The rise of large language models (LLMs), such as ChatGPT, DeepSeek, and Grok-3, has reshaped the artificial intelligence landscape. As prominent examples of foundational models (FMs) built on LLMs, these models exhibit remarkable capabilities in generating human-like content, bringing us closer to achieving artificial general intelligence (AGI). However, their large-scale nature, sensitivity to privacy concerns, and substantial computational demands present significant challenges to personalized customization for end users. To bridge this gap, this paper presents the vision of artificial personalized intelligence (API), focusing on adapting these powerful models to meet the specific needs and preferences of users while maintaining privacy and efficiency. Specifically, this paper proposes personalized federated intelligence (PFI), which integrates the privacy-preserving advantages of federated learning (FL) with the zero-shot generalization capabilities of FMs, enabling personalized, efficient, and privacy-protective deployment at the edge. We first review recent advances in both FL and FMs, and discuss the potential of leveraging FMs to enhance federated systems. We then present the key motivations behind realizing PFI and explore promising opportunities in this space, including efficient PFI, trustworthy PFI, and PFI empowered by retrieval-augmented generation (RAG). Finally, we outline key challenges and future research directions for deploying FM-powered FL systems at the edge with improved personalization, computational efficiency, and privacy guarantees. Overall, this survey aims to lay the groundwork for the development of API as a complement to AGI, with a particular focus on PFI as a key enabling technique.


Efficient Machine Unlearning by Model Splitting and Core Sample Selection

arXiv.org Machine Learning

Machine unlearning is essential for meeting legal obligations such as the right to be forgotten, which requires the removal of specific data from machine learning models upon request. While several approaches to unlearning have been proposed, existing solutions often struggle with efficiency and, more critically, with the verification of unlearning - particularly in the case of weak unlearning guarantees, where verification remains an open challenge. We introduce a generalized variant of the standard unlearning metric that enables more efficient and precise unlearning strategies. We also present an unlearning-aware training procedure that, in many cases, allows for exact unlearning. We term our approach MaxRR. When exact unlearning is not feasible, MaxRR still supports efficient unlearning with properties closely matching those achieved through full retraining.


Causal mediation analysis with one or multiple mediators: a comparative study

arXiv.org Machine Learning

Mediation analysis breaks down the causal effect of a treatment on an outcome into an indirect effect, acting through a third group of variables called mediators, and a direct effect, operating through other mechanisms. Mediation analysis is hard because confounders between treatment, mediators, and outcome blur effect estimates in observational studies. Many estimators have been proposed to adjust on those confounders and provide accurate causal estimates. We consider parametric and non-parametric implementations of classical estimators and provide a thorough evaluation for the estimation of the direct and indirect effects in the context of causal mediation analysis for binary, continuous, and multi-dimensional mediators. We assess several approaches in a comprehensive benchmark on simulated data. Our results show that advanced statistical approaches such as the multiply robust and the double machine learning estimators achieve good performances in most of the simulated settings and on real data. As an example of application, we propose a thorough analysis of factors known to influence cognitive functions to assess if the mechanism involves modifications in brain morphology using the UK Biobank brain imaging cohort. This analysis shows that for several physiological factors, such as hypertension and obesity, a substantial part of the effect is mediated by changes in the brain structure. This work provides guidance to the practitioner from the formulation of a valid causal mediation problem, including the verification of the identification assumptions, to the choice of an adequate estimator.


MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering

arXiv.org Artificial Intelligence

We introduce MLE-Dojo, a Gym-style framework for systematically reinforcement learning, evaluating, and improving autonomous large language model (LLM) agents in iterative machine learning engineering (MLE) workflows. Unlike existing benchmarks that primarily rely on static datasets or single-attempt evaluations, MLE-Dojo provides an interactive environment enabling agents to iteratively experiment, debug, and refine solutions through structured feedback loops. Built upon 200+ real-world Kaggle challenges, MLE-Dojo covers diverse, open-ended MLE tasks carefully curated to reflect realistic engineering scenarios such as data processing, architecture search, hyperparameter tuning, and code debugging. Its fully executable environment supports comprehensive agent training via both supervised fine-tuning and reinforcement learning, facilitating iterative experimentation, realistic data sampling, and real-time outcome verification. Extensive evaluations of eight frontier LLMs reveal that while current models achieve meaningful iterative improvements, they still exhibit significant limitations in autonomously generating long-horizon solutions and efficiently resolving complex errors. Furthermore, MLE-Dojo's flexible and extensible architecture seamlessly integrates diverse data sources, tools, and evaluation protocols, uniquely enabling model-based agent tuning and promoting interoperability, scalability, and reproducibility. We open-source our framework and benchmarks to foster community-driven innovation towards next-generation MLE agents.