Understanding a Version of Multivariate Symmetric Uncertainty to assist in Feature Selection

Sosa-Cabrera, Gustavo, García-Torres, Miguel, Gómez, Santiago, Schaerer, Christian, Divina, Federico

Sep-25-2017–arXiv.org Machine Learning

In these spaces of high dimensionality, feature selection is a way to exclude those irrelevant and redundant features, whose presence might complicate the task of knowledge discovery. In classification tasks, a feature is considered irrelevant if it contains no information about the class and therefore it is not necessary at all for the predictive task. Besides, it is widely accepted that two features are redundant if their values are correlated. There are several well known measures that compare features and determine their importance, such as the symmetrical uncertainty (SU)[2]. SU is a measure based on information that uses entropy and conditional entropy values to determine the correlation between pairs of features.

artificial intelligence, cardinality, machine learning, (14 more...)

arXiv.org Machine Learning

Sep-25-2017

arXiv.org PDF

Add feedback

Country:
- South America (0.14)
- Europe > Spain (0.14)

Genre:
- Research Report (0.50)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found