Goto

Collaborating Authors

 safe screening



Safe Distributionally Robust Feature Selection under Covariate Shift

arXiv.org Machine Learning

In practical machine learning, the environments encountered during the model development and deployment phases often differ, especially when a model is used by many users in diverse settings. Learning models that maintain reliable performance across plausible deployment environments is known as distributionally robust (DR) learning. In this work, we study the problem of distributionally robust feature selection (DRFS), with a particular focus on sparse sensing applications motivated by industrial needs. In practical multi-sensor systems, a shared subset of sensors is typically selected prior to deployment based on performance evaluations using many available sensors. At deployment, individual users may further adapt or fine-tune models to their specific environments. When deployment environments differ from those anticipated during development, this strategy can result in systems lacking sensors required for optimal performance. To address this issue, we propose safe-DRFS, a novel approach that extends safe screening from conventional sparse modeling settings to a DR setting under covariate shift. Our method identifies a feature subset that encompasses all subsets that may become optimal across a specified range of input distribution shifts, with finite-sample theoretical guarantees of no false feature elimination.



Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing Systems

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. The authors propose a new screening rule Slores for pre-filtering variables for logistic regression. This statement though sounds too simple and doesn't give the paper justice at all. The paper provides a rigorous and theoretically well founded derivation of a novel pre-screening rule which could in principle be extended to other settings as well. The method is also efficient compared to other safe rules that guarantee to discard only non-zero entries.


Safe Screening for Unbalanced Optimal Transport

arXiv.org Artificial Intelligence

Optimal transport (OT), as a metric, has gained significant attention in the field of machine learning in recent years due to its remarkable ability to capture geometric relationships between data distributions. It has demonstrated impressive achievements in many fields [16, 3, 13, 31]. To overcome the limitation of OT in handling data with unequal quantities, researchers introduced unbalanced optimal transport (UOT) [8] by relaxing the constraints using penalty functions. UOT has been found extensive applications in computational biology [40], machine learning [25], and deep learning domains [51]. However, compared to traditional metrics, the computational burden associated with OT, including UOT, has impeded their widespread adoption on large-scale problems. The state-of-the-art linear programming algorithms suffer from cubic computational complexity and are challenging to parallelize on GPUs [46].


Efficient Model Selection for Predictive Pattern Mining Model by Safe Pattern Pruning

arXiv.org Artificial Intelligence

Predictive pattern mining is an approach used to construct prediction models when the input is represented by structured data, such as sets, graphs, and sequences. The main idea behind predictive pattern mining is to build a prediction model by considering substructures, such as subsets, subgraphs, and subsequences (referred to as patterns), present in the structured data as features of the model. The primary challenge in predictive pattern mining lies in the exponential growth of the number of patterns with the complexity of the structured data. In this study, we propose the Safe Pattern Pruning (SPP) method to address the explosion of pattern numbers in predictive pattern mining. We also discuss how it can be effectively employed throughout the entire model building process in practical data analysis. To demonstrate the effectiveness of the proposed method, we conduct numerical experiments on regression and classification problems involving sets, graphs, and sequences.


Beyond GAP screening for Lasso by exploiting new dual cutting half-spaces with supplementary material

arXiv.org Machine Learning

In this paper, we propose a novel safe screening test for Lasso. Our procedure is based on a safe region with a dome geometry and exploits a canonical representation of the set of half-spaces (referred to as "dual cutting half-spaces" in this paper) containing the dual feasible set. The proposed safe region is shown to be always included in the state-of-the-art "GAP Sphere" and "GAP Dome" proposed by Fercoq et al. (and strictly so under very mild conditions) while involving the same computational burden. Numerical experiments confirm that our new dome enables to devise more powerful screening tests than GAP regions and lead to significant acceleration to solve Lasso.


Accelerating Non-Negative and Bounded-Variable Linear Regression Algorithms with Safe Screening

arXiv.org Machine Learning

Non-negative and bounded-variable linear regression problems arise in a variety of applications in machine learning and signal processing. In this paper, we propose a technique to accelerate existing solvers for these problems by identifying saturated coordinates in the course of iterations. This is akin to safe screening techniques previously proposed for sparsity-regularized regression problems. The proposed strategy is provably safe as it provides theoretical guarantees that the identified coordinates are indeed saturated in the optimal solution. Experimental results on synthetic and real data show compelling accelerations for both non-negative and bounded-variable problems.


Safe Screening for Logistic Regression with $\ell_0$-$\ell_2$ Regularization

arXiv.org Machine Learning

In logistic regression, it is often desirable to utilize regularization to promote sparse solutions, particularly for problems with a large number of features compared to available labels. In this paper, we present screening rules that safely remove features from logistic regression with $\ell_0-\ell_2$ regularization before solving the problem. The proposed safe screening rules are based on lower bounds from the Fenchel dual of strong conic relaxations of the logistic regression problem. Numerical experiments with real and synthetic data suggest that a high percentage of the features can be effectively and safely removed apriori, leading to substantial speed-up in the computations.


Simultaneous Safe Screening of Features and Samples in Doubly Sparse Modeling

arXiv.org Machine Learning

The problem of learning a sparse model is conceptually interpreted as the process of identifying active features/samples and then optimizing the model over them. Recently introduced safe screening allows us to identify a part of non-active features/samples. So far, safe screening has been individually studied either for feature screening or for sample screening. In this paper, we introduce a new approach for safely screening features and samples simultaneously by alternatively iterating feature and sample screening steps. A significant advantage of considering them simultaneously rather than individually is that they have a synergy effect in the sense that the results of the previous safe feature screening can be exploited for improving the next safe sample screening performances, and vice-versa. We first theoretically investigate the synergy effect, and then illustrate the practical advantage through intensive numerical experiments for problems with large numbers of features and samples.