Identifying The Most Informative Features Using A Structurally Interacting Elastic Net
Cui, Lixin, Bai, Lu, Zhang, Zhihong, Wang, Yue, Hancock, Edwin R.
Feature selection can efficiently identify the most informative features with respect to the target feature used in training. However, state-of-the-art vector-based methods are unable to encapsulate the relationships between feature samples into the feature selection process, thus leading to significant information loss. To address this problem, we propose a new graph-based structurally interacting elastic net method for feature selection. Specifically, we commence by constructing feature graphs that can incorporate pairwise relationship between samples. With the feature graphs to hand, we propose a new information theoretic criterion to measure the joint relevance of different pairwise feature combinations with respect to the target feature graph representation. This measure is used to obtain a structural interaction matrix where the elements represent the proposed information theoretic measure between feature pairs. We then formulate a new optimization model through the combination of the structural interaction matrix and an elastic net regression model for the feature subset selection problem. This allows us to a) preserve the information of the original vectorial space, b) remedy the information loss of the original feature space caused by using graph representation, and c) promote a sparse solution and also encourage correlated features to be selected. Because the proposed optimization problem is non-convex, we develop an efficient alternating direction multiplier method (ADMM) to locate the optimal solutions. Extensive experiments on various datasets demonstrate the effectiveness of the proposed method. Keywords: Feature Selection; Graph; Interacting Elastic Net; Sparse; ADMM 1. Introduction There has recently been a rapid growth in both the size and dimension of the data encountered in many real world applications of pattern recognition including image processing, bioinformatics, and financial analysis. Finding useful information and building effective prediction models from such data presents new challenges for machine learning and pattern recognition [1]. One way to overcome this problem is to develop efficient spectral methods including stochastic neighbour embedding [2], elastic embedding methods [3] and feature selection [4] methods to reduce the dimensionality of the data.
Sep-8-2018
- Country:
- South America > Paraguay
- North America
- United States
- Nevada (0.04)
- Washington > King County
- Bellevue (0.04)
- California
- San Francisco County > San Francisco (0.14)
- Santa Clara County > Stanford (0.04)
- Canada > British Columbia
- United States
- Europe
- Italy (0.04)
- United Kingdom > England
- North Yorkshire > York (0.04)
- Asia
- Middle East > Israel
- Haifa District > Haifa (0.04)
- China
- Fujian Province > Xiamen (0.04)
- Beijing > Beijing (0.04)
- Middle East > Israel
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Health & Medicine (1.00)
- Technology: