Exploratory Data Analysis: Baby Steps

#artificialintelligence 

It is also used to identify the outliers in the dataset. Here we can see that the mean is around 50000. There are also few outliers at 60000 and 1000000, which should be treated in the preprocessing stage. A count plot can be thought of as a histogram across a categorical, instead of numeric, variable. It is used to find the frequency of each category. Here we can see that category "86" is dominating over the other categories. These are the basic, initial steps in exploratory data analysis. I wish to cover the rest of the steps in the next few articles. I hope you found this short article helpful.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found