Goto

Collaborating Authors

 machine learning visualisation


A Simple Guide to Machine Learning Visualisations - KDnuggets

#artificialintelligence

An important step in developing machine learning models is to evaluate the performance. Depending on the type of machine learning problem that you are dealing with, there is generally a choice of metrics to choose from to perform this step. However, simply looking at one or two numbers in isolation cannot always enable us to make the right choice for model selection. For example, a single error metric doesn't give us any information about the distribution of the errors. It does not answer questions like is the model wrong in a big way a small number of times, or is it producing lots of smaller errors?


A Simple Guide to Machine Learning Visualisations

#artificialintelligence

The Yellowbrick library also contains a set of visualisation tools for analysing clustering algorithms. A common way to evaluate the performance of clustering models is with an intercluster distance map. The intercluster distance map plots an embedding of each cluster centre and visualises both the distance between the clusters and the relative size of each cluster based on membership. We can turn the diabetes dataset into a clustering problem by only using the features (X). Before we cluster the data we can use the popular elbow method to find the optimal number of clusters.