About Feature Scaling and Normalization

Aug-22-2016, 03:10:28 GMT–#artificialintelligence

The result of standardization (or Z-score normalization) is that the features will be rescaled so that they'll have the properties of a standard normal distribution with Standardizing the features so that they are centered around 0 with a standard deviation of 1 is not only important if we are comparing measurements that have different units, but it is also a general requirement for many machine learning algorithms. Intuitively, we can think of gradient descent as a prominent example (an optimization algorithm often used in logistic regression, SVMs, perceptrons, neural networks etc.); with features being on different scales, certain weights may update faster than others since the feature values play a role in the weight updates Other intuitive examples include K-Nearest Neighbor algorithms and clustering algorithms that use, for example, Euclidean distance measures – in fact, tree-based classifier are probably the only classifiers where feature scaling doesn't make a difference. In fact, the only family of algorithms that I could think of being scale-invariant are tree-based methods. Let's take the general CART decision tree algorithm. Intuitively, we can see that it really doesn't matter on which scale this feature is (centimeters, Fahrenheit, a standardized scale – it really doesn't matter).

artificial intelligence, machine learning, standardization, (13 more...)

#artificialintelligence

Aug-22-2016, 03:10:28 GMT

News Web Page

Add feedback

Country:
- Europe > Italy (0.15)
- North America > United States
  - California (0.15)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Learning Graphical Models > Directed Networks
    - Bayesian Learning (0.49)
  - Statistical Learning > Nearest Neighbor Methods (0.89)