AITopics | high dimensionality and mixed datatype

Collaborating Authors

high dimensionality and mixed datatype

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

How to cluster dataset with high dimensionality and mixed datatypes

#artificialintelligenceNov-25-2019, 09:34:37 GMT

When it comes to cluster analysis for retail and e-commerce customer data, more often than not, you will find the dataset messy, high dimensional and with many categorical variables. Although there are many dimensional reduction techniques, most of them do not work well with the dataset with many categorical variables. Traditionally, clustering approaches suffer when features are not clean numeric values. For example, the most popular algorithm KNN can only handle numeric variables. Generalized low rank models (GLRMs), developed by students at Stanford University (see Udell '16) -- propose a new clustering framework to handle all types of data even with mixed datatypes.

categorical variable, dataset, high dimensionality and mixed datatype, (7 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.38)

Add feedback