Feature selection for classification with class-separability strategy and data envelopment analysis
Zhang, Yishi, Yang, Chao, Yang, Anrong, Xiong, Chan, Zhou, Xingchi, Zhang, Zigang
In this paper, a novel feature selection method is presented, which is based on Class-Separability (CS) strategy and Data Envelopment Analysis (DEA). To better capture the relationship between features and the class, class labels are separated into individual variables and relevance and redundancy are explicitly handled on each class label. Super-efficiency DEA is employed to evaluate and rank features via their conditional dependence scores on all class labels, and the feature with maximum super-efficiency score is then added in the conditioning set for conditional dependence estimation in the next iteration, in such a way as to iteratively select features and get the final selected features. Eventually, experiments are conducted to evaluate the effectiveness of proposed method comparing with four state-of-the-art methods from the viewpoint of classification accuracy. Empirical results verify the feasibility and the superiority of proposed feature selection method. Keywords: Feature selection, classification, class-separability strategy, data envelopment analysis, super-efficiency 1. Introduction The explosion of large datasets in many fields poses unprecedented challenges to pattern recognition and data mining. Not only is the scale of samples getting larger, but also new types of data become prevalent. For example, tremendous new computer and Internet applications generate large amounts of types of data at an exponential rate in the world. It is thus realized that feature selection is an indispensable component [1]. Feature selection is a process of selecting a subset of original features according to certain criteria. It is an important and frequently used technique for dimension reduction.
Feb-1-2015
- Country:
- North America > United States
- California (0.68)
- Europe > United Kingdom
- England (0.46)
- North America > United States
- Genre:
- Research Report > Promising Solution (0.34)
- Industry:
- Health & Medicine (0.68)
- Technology: