Goto

Collaborating Authors

 nuisance


Fight between Waymo and Santa Monica goes to court

Los Angeles Times

Things to Do in L.A. Tap to enable a layout that focuses on the article. Self-driving vehicles charge at the Waymo station in Santa Monica. This is read by an automated voice. Please report any issues or inconsistencies here . Waymo is taking the city of Santa Monica to court after the city ordered the company to cease charging its autonomous vehicles at two facilities overnight, claiming the lights and beeping at the lots were a nuisance to residents.



Mind the Nuisance: Gaussian Process Classification using Privileged Noise

Neural Information Processing Systems

The learning with privileged information setting has recently attracted a lot of attention within the machine learning community, as it allows the integration of additional knowledge into the training process of a classifier, even when this comes in the form of a data modality that is not available at test time. Here, we show that privileged information can naturally be treated as noise in the latent function of a Gaussian process classifier (GPC). That is, in contrast to the standard GPC setting, the latent function is not just a nuisance but a feature: it becomes a natural measure of confidence about the training data by modulating the slope of the GPC probit likelihood function. Extensive experiments on public datasets show that the proposed GPC method using privileged noise, called GPC+, improves over a standard GPC without privileged knowledge, and also over the current state-of-the-art SVM-based method, SVM+. Moreover, we show that advanced neural networks and deep learning methods can be compressed as privileged information.


Deep learning for exoplanet detection and characterization by direct imaging at high contrast

Bodrito, Théo, Flasseur, Olivier, Mairal, Julien, Ponce, Jean, Langlois, Maud, Lagrange, Anne-Marie

arXiv.org Artificial Intelligence

Exoplanet imaging is a major challenge in astrophysics due to the need for high angular resolution and high contrast. We present a multi-scale statistical model for the nuisance component corrupting multivariate image series at high contrast. Integrated into a learnable architecture, it leverages the physics of the problem and enables the fusion of multiple observations of the same star in a way that is optimal in terms of detection signal-to-noise ratio. Applied to data from the VLT/SPHERE instrument, the method significantly improves the detection sensitivity and the accuracy of astrometric and photometric estimation.


DoubleGen: Debiased Generative Modeling of Counterfactuals

Luedtke, Alex, Fukumizu, Kenji

arXiv.org Machine Learning

Generative models for counterfactual outcomes face two key sources of bias. Confounding bias arises when approaches fail to account for systematic differences between those who receive the intervention and those who do not. Misspecification bias arises when methods attempt to address confounding through estimation of an auxiliary model, but specify it incorrectly. We introduce DoubleGen, a doubly robust framework that modifies generative modeling training objectives to mitigate these biases. The new objectives rely on two auxiliaries -- a propensity and outcome model -- and successfully address confounding bias even if only one of them is correct. We provide finite-sample guarantees for this robustness property. We further establish conditions under which DoubleGen achieves oracle optimality -- matching the convergence rates standard approaches would enjoy if interventional data were available -- and minimax rate optimality. We illustrate DoubleGen with three examples: diffusion models, flow matching, and autoregressive language models.


Stochastic Gradients under Nuisances

Yu, Facheng, Mehta, Ronak, Luedtke, Alex, Harchaoui, Zaid

arXiv.org Machine Learning

Stochastic gradient optimization is the dominant learning paradigm for a variety of scenarios, from classical supervised learning to modern self-supervised learning. We consider stochastic gradient algorithms for learning problems whose objectives rely on unknown nuisance parameters, and establish non-asymptotic convergence guarantees. Our results show that, while the presence of a nuisance can alter the optimum and upset the optimization trajectory, the classical stochastic gradient algorithm may still converge under appropriate conditions, such as Neyman orthogonality. Moreover, even when Neyman orthogonality is not satisfied, we show that an algorithm variant with approximately orthogonalized updates (with an approximately orthogonalized gradient oracle) may achieve similar convergence rates. Examples from orthogonal statistical learning/double machine learning and causal inference are discussed.


Appendix - Double Machine Learning Density Estimation for Local Treatment Effects with Instruments A IV Settings and LTE

Neural Information Processing Systems

We show that the causal graph in Figure 1 captures the set of IV assumptions in Assumption A.1. We will show the first item. We will show the second. We will show the third. Now, we will prove this Lemma through the master result in Lemma S.1.


Mind the Nuisance: Gaussian Process Classification using Privileged Noise

Neural Information Processing Systems

The learning with privileged information setting has recently attracted a lot of attention within the machine learning community, as it allows the integration of additional knowledge into the training process of a classifier, even when this comes in the form of a data modality that is not available at test time. Here, we show that privileged information can naturally be treated as noise in the latent function of a Gaussian process classifier (GPC). That is, in contrast to the standard GPC setting, the latent function is not just a nuisance but a feature: it becomes a natural measure of confidence about the training data by modulating the slope of the GPC probit likelihood function. Extensive experiments on public datasets show that the proposed GPC method using privileged noise, called GPC, improves over a standard GPC without privileged knowledge, and also over the current state-of-the-art SVM-based method, SVM . Moreover, we show that advanced neural networks and deep learning methods can be compressed as privileged information.


B-Learner: Quasi-Oracle Bounds on Heterogeneous Causal Effects Under Hidden Confounding

Oprescu, Miruna, Dorn, Jacob, Ghoummaid, Marah, Jesson, Andrew, Kallus, Nathan, Shalit, Uri

arXiv.org Machine Learning

Estimating heterogeneous treatment effects from observational data is a crucial task across many fields, helping policy and decision-makers take better actions. There has been recent progress on robust and efficient methods for estimating the conditional average treatment effect (CATE) function, but these methods often do not take into account the risk of hidden confounding, which could arbitrarily and unknowingly bias any causal estimate based on observational data. We propose a meta-learner called the B-Learner, which can efficiently learn sharp bounds on the CATE function under limits on the level of hidden confounding. We derive the B-Learner by adapting recent results for sharp and valid bounds of the average treatment effect (Dorn et al., 2021) into the framework given by Kallus & Oprescu (2023) for robust and model-agnostic learning of conditional distributional treatment effects. The B-Learner can use any function estimator such as random forests and deep neural networks, and we prove its estimates are valid, sharp, efficient, and have a quasi-oracle property with respect to the constituent estimators under more general conditions than existing methods. Semi-synthetic experimental comparisons validate the theoretical findings, and we use real-world data to demonstrate how the method might be used in practice.


Michigan Supreme Court to hear dispute over legality of using drone to take pictures of salvage yard

FOX News

Fox News Flash top headlines are here. Check out what's clicking on Foxnews.com. The Michigan Supreme Court will hear a dispute over the legality of using a drone to take pictures of a salvage yard near Traverse City. Aerial photos were used as evidence in a lawsuit against Todd and Heather Maxon, who were accused of violating a zoning ordinance and creating a nuisance with cars and other salvaged material in Long Lake Township. The Maxons argue that aerial photos violated their constitutional right against unreasonable searches.