Goto

Collaborating Authors

 misidentification


How ICE is using facial recognition in Minnesota

The Guardian

A border patrol agent scans the face of a driver in Minneapolis on 13 January 2026. A border patrol agent scans the face of a driver in Minneapolis on 13 January 2026. Immigration enforcement agents across the US are increasingly relying on a new smartphone app with facial recognition technology. The app is named Mobile Fortify. Simply pointing a phone's camera at their intended target and scanning the person's face allows Mobile Fortify to pull data on an individual from multiple federal and state databases, some of which federal courts have deemed too inaccurate for arrest warrants.


Minimax and Bayes Optimal Best-arm Identification: Adaptive Experimental Design for Treatment Choice

arXiv.org Machine Learning

This study investigates adaptive experimental design for treatment choice, also known as fixed-budget best-arm identification. We consider an adaptive procedure consisting of a treatment-allocation phase followed by a treatment-choice phase, and we design an adaptive experiment for this setup to efficiently identify the best treatment arm, defined as the one with the highest expected outcome. In our designed experiment, the treatment-allocation phase consists of two stages. The first stage is a pilot phase, where we allocate each treatment arm uniformly with equal proportions to eliminate clearly suboptimal arms and estimate outcome variances. In the second stage, we allocate treatment arms in proportion to the variances estimated in the first stage. After the treatment-allocation phase, the procedure enters the treatment-choice phase, where we choose the treatment arm with the highest sample mean as our estimate of the best treatment arm. We prove that this single design is simultaneously asymptotically minimax and Bayes optimal for the simple regret, with upper bounds that match our lower bounds up to exact constants. Therefore, our designed experiment achieves the sharp efficiency limits without requiring separate tuning for minimax and Bayesian objectives.


Adaptive Generalized Neyman Allocation: Local Asymptotic Minimax Optimal Best Arm Identification

arXiv.org Machine Learning

This study investigates a local asymptotic minimax optimal strategy for fixed-budget best arm identification (BAI). We propose the Adaptive Generalized Neyman Allocation (AGNA) strategy and show that its worst-case upper bound of the probability of misidentifying the best arm aligns with the worst-case lower bound under the small-gap regime, where the gap between the expected outcomes of the best and suboptimal arms is small. Our strategy corresponds to a generalization of the Neyman allocation for two-armed bandits (Neyman, 1934; Kaufmann et al., 2016) and a refinement of existing strategies such as the ones proposed by Glynn & Juneja (2004) and Shin et al. (2018). Compared to Komiyama et al. (2022), which proposes a minimax rate-optimal strategy, our proposed strategy has a tighter upper bound that exactly matches the lower bound, including the constant terms, by restricting the class of distributions to the class of small-gap distributions. Our result contributes to the longstanding open issue about the existence of asymptotically optimal strategies in fixed-budget BAI, by presenting the local asymptotic minimax optimal strategy.


AI Alignment: A Comprehensive Survey

arXiv.org Artificial Intelligence

AI alignment aims to make AI systems behave in line with human intentions and values. As AI systems grow more capable, so do risks from misalignment. To provide a comprehensive and up-to-date overview of the alignment field, in this survey, we delve into the core concepts, methodology, and practice of alignment. First, we identify four principles as the key objectives of AI alignment: Robustness, Interpretability, Controllability, and Ethicality (RICE). Guided by these four principles, we outline the landscape of current alignment research and decompose them into two key components: forward alignment and backward alignment. The former aims to make AI systems aligned via alignment training, while the latter aims to gain evidence about the systems' alignment and govern them appropriately to avoid exacerbating misalignment risks. On forward alignment, we discuss techniques for learning from feedback and learning under distribution shift. On backward alignment, we discuss assurance techniques and governance practices. We also release and continually update the website (www.alignmentsurvey.com) which features tutorials, collections of papers, blog posts, and other resources.


Worst-Case Optimal Multi-Armed Gaussian Best Arm Identification with a Fixed Budget

arXiv.org Machine Learning

Experimental design is crucial in evidence-based decision-making with multiple treatment arms, such as online advertisements and medical treatments. This study investigates the problem of identifying the treatment arm with the highest expected outcome, referred to as the best treatment arm, while minimizing the probability of misidentification. This problem has been studied across numerous research fields, including best arm identification (BAI) and ordinal optimization. In our experiments, the number of treatment-allocation rounds is fixed. During each round, a decision-maker allocates a treatment arm to an experimental unit and observes a corresponding outcome, which follows a Gaussian distribution with variances that can differ among the treatment arms. At the end of the experiment, we recommend one of the treatment arms as an estimate of the best treatment arm based on the observations. To design an experiment, we first discuss the worst-case lower bound for the probability of misidentification through an information-theoretic approach. Then, under the assumption that the variances are known, we propose the Generalized-Neyman-Allocation (GNA)-empirical-best-arm (EBA) strategy, an extension of the Neyman allocation proposed by Neyman (1934). We show that the GNA-EBA strategy is asymptotically optimal in the sense that its probability of misidentification aligns with the lower bounds as the sample size increases indefinitely and the differences between the expected outcomes of the best and other suboptimal arms converge to a uniform value. We refer to such strategies as asymptotically worst-case optimal.


Asymptotically Optimal Fixed-Budget Best Arm Identification with Variance-Dependent Bounds

arXiv.org Artificial Intelligence

We investigate the problem of fixed-budget best arm identification (BAI) for minimizing expected simple regret. In an adaptive experiment, a decision maker draws one of multiple treatment arms based on past observations and observes the outcome of the drawn arm. After the experiment, the decision maker recommends the treatment arm with the highest expected outcome. We evaluate the decision based on the expected simple regret, which is the difference between the expected outcomes of the best arm and the recommended arm. Due to inherent uncertainty, we evaluate the regret using the minimax criterion. First, we derive asymptotic lower bounds for the worst-case expected simple regret, which are characterized by the variances of potential outcomes (leading factor). Based on the lower bounds, we propose the Two-Stage (TS)-Hirano-Imbens-Ridder (HIR) strategy, which utilizes the HIR estimator (Hirano et al., 2003) in recommending the best arm. Our theoretical analysis shows that the TS-HIR strategy is asymptotically minimax optimal, meaning that the leading factor of its worst-case expected simple regret matches our derived worst-case lower bound. Additionally, we consider extensions of our method, such as the asymptotic optimality for the probability of misidentification. Finally, we validate the proposed method's effectiveness through simulations.


Bayesian Fixed-Budget Best-Arm Identification

arXiv.org Artificial Intelligence

Fixed-budget best-arm identification (BAI) is a bandit problem where the agent maximizes the probability of identifying the optimal arm within a fixed budget of observations. In this work, we study this problem in the Bayesian setting. We propose a Bayesian elimination algorithm and derive an upper bound on its probability of misidentifying the optimal arm. The bound reflects the quality of the prior and is the first distribution-dependent bound in this setting. We prove it using a frequentist-like argument, where we carry the prior through, and then integrate out the bandit instance at the end. We also provide a lower bound on the probability of misidentification in a $2$-armed Bayesian bandit and show that our upper bound (almost) matches it for any budget. Our experiments show that Bayesian elimination is superior to frequentist methods and competitive with the state-of-the-art Bayesian algorithms that have no guarantees in our setting.


UN fails to agree on 'killer robot' ban as nations pour billions into autonomous weapons research

Robohub

Humanitarian groups have been calling for a ban on autonomous weapons. Autonomous weapon systems – commonly known as killer robots – may have killed human beings for the first time ever last year, according to a recent United Nations Security Council report on the Libyan civil war. History could well identify this as the starting point of the next major arms race, one that has the potential to be humanity's final one. The United Nations Convention on Certain Conventional Weapons debated the question of banning autonomous weapons at its once-every-five-years review meeting in Geneva Dec. 13-17, 2021, but didn't reach consensus on a ban. Established in 1983, the convention has been updated regularly to restrict some of the world's cruelest conventional weapons, including land mines, booby traps and incendiary weapons.


Optimal Fixed-Budget Best Arm Identification using the Augmented Inverse Probability Estimator in Two-Armed Gaussian Bandits with Unknown Variances

arXiv.org Machine Learning

We consider the fixed-budget best arm identification problem in two-armed Gaussian bandits with unknown variances. The tightest lower bound on the complexity and an algorithm whose performance guarantee matches the lower bound have long been open problems when the variances are unknown and when the algorithm is agnostic to the optimal proportion of the arm draws. In this paper, we propose a strategy comprising a sampling rule with randomized sampling (RS) following the estimated target allocation probabilities of arm draws and a recommendation rule using the augmented inverse probability weighting (AIPW) estimator, which is often used in the causal inference literature. We refer to our strategy as the RS-AIPW strategy. In the theoretical analysis, we first derive a large deviation principle for martingales, which can be used when the second moment converges in mean, and apply it to our proposed strategy. Then, we show that the proposed strategy is asymptotically optimal in the sense that the probability of misidentification achieves the lower bound by Kaufmann et al. (2016) when the sample size becomes infinitely large and the gap between the two arms goes to zero.


Hidden Pentagon records reveal patterns of failure in deadly U.S. airstrikes

The Japan Times

Shortly before 3 a.m. on July 19, 2016, U.S. Special Operations forces bombed what they believed were three Islamic State (IS) group "staging areas" on the outskirts of Tokhar, a riverside hamlet in northern Syria. They reported 85 fighters killed. In fact, they hit houses far from the front line, where farmers, their families and other local people sought nighttime sanctuary from bombing and gunfire. More than 120 villagers were killed. In early 2017 in Iraq, an American war plane struck a dark-colored vehicle, believed to be a car bomb, stopped at an intersection in the Wadi Hajar neighborhood of West Mosul. Actually, the car had been bearing not a bomb but a man named Majid Mahmoud Ahmed, his wife and their two children, who were fleeing the fighting nearby. They and three other civilians were killed. In November 2015, after observing a man dragging an "unknown heavy object" into an IS "defensive fighting position," U.S. forces struck a building in Ramadi, Iraq. A military review found that the object was actually "a person of small stature" -- a child -- who died in the strike. None of these deadly failures resulted in a finding of wrongdoing. These cases are drawn from a hidden Pentagon archive of the American air war in the Middle East since 2014. The trove of documents -- the military's own confidential assessments of more than 1,300 reports of civilian casualties, obtained by The New York Times -- lays bare how the air war has been marked by deeply flawed intelligence, rushed and often imprecise targeting and the deaths of thousands of civilians, many of them children, a sharp contrast to the U.S. government's image of war waged by all-seeing drones and precision bombs. The documents show, too, that despite the Pentagon's highly codified system for examining civilian casualties, pledges of transparency and accountability have given way to opacity and impunity. In only a handful of cases were the assessments made public. Not a single record provided includes a finding of wrongdoing or disciplinary action. Fewer than a dozen condolence payments were made, even though many survivors were left with disabilities requiring expensive medical care. Documented efforts to identify root causes or lessons learned are rare. The air campaign represents a fundamental transformation of warfare that took shape in the final years of the Obama administration, amid the deepening unpopularity of the forever wars that had claimed more than 6,000 American service members. The United States traded many of its boots on the ground for an arsenal of aircraft directed by controllers sitting at computers, often thousands of kilometers away. President Barack Obama called it "the most precise air campaign in history." This was the promise: America's "extraordinary technology" would allow the military to kill the right people while taking the greatest possible care not to harm the wrong ones. The IS caliphate ultimately crumbled under the weight of American bombing.