Goto

Collaborating Authors

 Genre


Integrating Feature Correlation in Differential Privacy with Applications in DP-ERM

arXiv.org Machine Learning

Standard differential privacy imposes uniform privacy constraints across all features, overlooking the inherent distinction between sensitive and insensitive features in practice. In this paper, we introduce a relaxed definition of differential privacy that accounts for such heterogeneity, allowing certain features to be treated as insensitive even when correlated with sensitive ones. We propose a correlation-aware framework, $\textsf{CorrDP}$, which relaxes privacy for insensitive features while accounting for their correlations with sensitive features, with the correlations quantified using total variation distance. We design algorithms for differentially private empirical risk minimization (DP-ERM) under the $\textsf{CorrDP}$ framework, incorporating distance-dependent noise into gradients for improved theoretical utility guarantees. When the correlation distance is unknown, we estimate it from the dataset and show that it achieves a comparable privacy-utility guarantee. We perform experiments on synthetic and real-world datasets and show that $\textsf{CorrDP}$-based DP-ERM algorithms consistently outperform the standard DP framework in the presence of insensitive features.


Conditional Diffusion Sampling

arXiv.org Machine Learning

Sampling from unnormalized multimodal distributions with limited density evaluations remains a fundamental challenge in machine learning and natural sciences. Successful approaches construct a bridge between a tractable reference and the target distribution. Parallel Tempering (PT) serves as the gold standard, while recent diffusion-based approaches offer a continuous alternative at the cost of neural training. In this work, we introduce Conditional Diffusion Sampling (CDS), a framework that combines these two paradigms. To this end, we derive Conditional Interpolants, a class of stochastic processes whose transport dynamics are governed by an exact, closed-form stochastic differential equation (SDE), requiring no neural approximation. Although these dynamics require sampling from a non-trivial initialization distribution, we show both theoretically and empirically that the cost of this initialization diminishes for sufficiently short diffusion times. CDS leverages this by a two-stage procedure: (1) PT is used to efficiently sample the initial distribution, and then (2) samples are transported via the transport SDE. This combination couples the robust global exploration of PT with efficient local transport. Experiments suggest that CDS has the potential to achieve a superior trade-off between sample quality and density evaluation cost compared to state-of-the-art samplers.


Xbox is ditching Microsoft's Copilot AI

Engadget

Xbox is ditching Microsoft's Copilot AI Xbox is ditching Microsoft's Copilot AI Microsoft announced plans to start stripping Copilot out of select Windows apps in March after criticism of the company's mishandling of its operating system reached a fever pitch. As it turns out though, Windows isn't the only place where you'll see less Copilot: Xbox CEO Asha Sharma has announced that the AI assistant will also be removed from the gaming brand's mobile app and Xbox consoles. Under previous Xbox leadership, Copilot was introduced as a sort of in-game assistant that would be aware of what you're playing and able to offer contextual advice based on what's on your screen. Microsoft launched a beta version of the experience by adding Copilot to the Xbox mobile app in May 2025, but based on a GDC presentation the company gave in March, the plan was to also bring Copilot to Xbox consoles later this year. Those plans apparently don't align with where Xbox is headed, Sharma said in a post announcing new hires to the Xbox division.


New moth species named for Pope Leo

Popular Science

'Pyralis papaleonei' reflects his strong stance on environmental conservation. More information Adding us as a Preferred Source in Google by using this link indicates that you would like to see more of our content in Google News results. The moth appears to be endemic to the island of Crete. Breakthroughs, discoveries, and DIY tips sent six days a week. Pope Leo XIV receives gifts from visitors from all over the world every year, but a newly identified insect may be the first papal tribute of its kind.


Microsoft, Google, xAI give US access to AI models for security testing

Al Jazeera

Tech giants Microsoft, Google and xAI say they will allow the United States federal government access to their new artificial intelligence models for national security testing. The Center for AI Standards and Innovation (CAISI) at the Department of Commerce announced the agreement on Tuesday amid increasing concerns about the capabilities that Anthropic's newly unveiled Mythos model could give hackers. The agreement fulfils a pledge the administration of US President Donald Trump made in July to partner with technology companies to vet their AI models for "national security risks". Microsoft will work with US government scientists to test AI systems "in ways that probe unexpected behaviors", the company said in a statement. Together they will develop shared data sets and workflows for testing the company's models, the company said.


Robotically assembled building blocks could make construction more efficient and sustainable

Robohub

Robotically assembled building blocks could be a more environmentally friendly method for erecting large-scale structures than some existing construction techniques, according to a new study by MIT researchers. The team conducted a feasibility study to evaluate the efficiency of constructing a simple building using "voxels," which are modular 3D subunits that assemble into complex, durable structures. After studying the performance of multiple voxels, the researchers developed three new designs intended to streamline building construction. They also produced a robotic assembler and a user-friendly interface for generating voxel-based building layouts and feeding instructions to the robots. Their results indicate this voxel-based robotic assembly system could reduce embodied carbon -- all of the carbon emitted during the lifecycle of building materials -- by as much as 82 percent, compared with popular techniques like 3D concrete printing, precast modular concrete, and steel framing.


Prehistoric child's finger bone, bear tooth pendant, and more discovered in Spanish cave

Popular Science

Science Archaeology Prehistoric child's finger bone, bear tooth pendant, and more discovered in Spanish cave Nestled in the Pyrenees Mountains, the high-altitude cave may have been an ancient mining camp and burial ground. More information Adding us as a Preferred Source in Google by using this link indicates that you would like to see more of our content in Google News results. Cave 338 is located at 7,332 feet (2,235 meters) above sea level in the Nรบria Valley (Queralbs, Ripollรจs). Breakthroughs, discoveries, and DIY tips sent six days a week. Life at high altitudes is unforgiving.


Design, Cups, and Blankets. A Free-Energy-Principle-Based Approach to Product Design

arXiv.org Machine Learning

Classical design theory treats the type of an object as a given: the designer decides in advance that this will be a cup, then optimizes its parameters. This paper argues that object type is not a presupposition but an inference, something that can be determined from physical data and functional requirements jointly. We call this problem requirement-steered interface type inference and show that it is inexpressible within existing design frameworks. This paper makes two contributions that are jointly necessary and individually incomplete. The first is the problem itself, which classical design cannot pose because it presupposes the very thing our problem seeks to determine. The second is C-DMBD, a constrained extension of the Dynamic Markov Blanket Detection algorithm, which makes requirement-steered inference computationally tractable. Drawing on the free-energy principle and active inference, established frameworks in theoretical neuroscience and Bayesian mechanics, we model a product's surface as a Markov blanket: the minimal boundary through which all causal exchange between object and environment must pass. Different blanket structures correspond to different object types; different parameterizations of the same structure correspond to different functional modes of the same type. This paper is a proof of concept and a theoretical proposal. It reframes design as inference rather than optimization, and as a relation between generative models rather than a specification of parameters.


A Continuous-Time Ensemble Kalman-Bucy Smoother for Causal Inference and Model Discovery

arXiv.org Machine Learning

Data assimilation (DA) integrates observational information with model predictions to improve state estimation in complex systems. While filtering provides the basis for online forecasts by using only past and present observations, it can exhibit delays and biases when the underlying dynamics evolve rapidly or undergo regime transitions. Smoothing, which additionally incorporates future observations, provides a natural pipeline for hindcasting and reanalysis that yields an uncertainty reduction beyond the filter. This paper introduces an ensemble Kalman-Bucy smoother (EnKBS) for continuous-time DA of nonlinear dynamical systems, where the smoother's conditional distributions are reconstructed using ensemble moments. The result is a derivative-free framework that does not require explicit computation of tangent-linear or adjoint models, which converges to the exact smoother solution at the infinite-ensemble limit for a wide class of complex systems. Incorporating standard regularization techniques for high-dimensional systems, such as covariance localization and inflation, the skill of the EnKBS is demonstrated in various important scientific problems. By integrating future observations, which reveal the underlying causal mechanisms for retrospective state updates, the EnKBS is used for Bayesian-based inference of causal relationships and their temporal influence range in a dyadic trigger-feedback model and the development of a causality-driven iterative learning algorithm that identifies the structure and recovers the hidden parameters of a nonlinear reduced-order model mimicking midlatitude atmospheric circulation. Notably, both tasks remain effective with an ensemble size of $O(10)$ under partial observations, suggesting that EnKBS can support the instantaneous discovery of high-dimensional complex systems over time.


An Efficient Spatial Branch-and-Bound Algorithm for Global Optimization of Gaussian Process Posterior Mean Functions

arXiv.org Machine Learning

We study the deterministic global optimization of trained Gaussian process posterior mean functions over hyperrectangular domains. Although the posterior mean function has a compact closed-form representation, its global optimization is challenging because it remains nonlinear and nonconvex. Existing exact deterministic approaches become increasingly difficult to scale as the number of training data points grows, leading to approximation-based methods that improve tractability by optimizing a modified (inexact) objective. In this work, we propose PALM-Mean, a piecewise-analytic lower-bounding framework embedded in reduced-space spatial branch-and-bound. At each node, kernel terms that are locally important are replaced by a sign-aware piecewise-linear relaxation in an appropriate scalar distance variable, while the remaining terms are bounded analytically in closed form. We show this hybrid approach yields a valid lower bound for the posterior mean, while limiting the size of the branch-and-bound subproblems. We establish validity of the node lower bounds and $\varepsilon$-global convergence of the resulting algorithm. Computational results on synthetic benchmarks and real-world application problems show that PALM-Mean improves scalability relative to representative general-purpose deterministic global solvers, particularly as the number of training data points increases.