Stop One-Hot Encoding Your Categorical Variables.

Sep-3-2020, 13:45:47 GMT–#artificialintelligence

One-hot encoding, otherwise known as dummy variables, is a method of converting categorical variables into several binary columns, where a 1 indicates the presence of that row belonging to that category. It is, pretty obviously, not a great a choice for the encoding of categorical variables from a machine learning perspective. Most apparent is the heavy amount of dimensionality it adds, and it is common knowledge that generally a lower amount of dimensions is better. For example, if we were to have a column representing a US state (e.g. California, New York), a one-hot encoding scheme would result in fifty additional dimensions.

artificial intelligence, dimension, machine learning, (12 more...)

#artificialintelligence

Sep-3-2020, 13:45:47 GMT

News Web Page

Add feedback

Country:
- North America > United States
  - New York (0.25)
  - California (0.25)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.92)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found