Exploratory Data Analysis (EDA) -- Don't ask how, ask what
EDA or Exploratory Data Analysis is the process of understanding what data we have in our dataset before we start finding solutions to our problem. In other words -- it is the act of analyzing the data without biased assumptions in order to effectively preprocess the dataset for modeling. The main reasons we do EDA are to verify the data in the dataset, to check if the data makes sense in the context of the problem, and even sometimes just to learn about the problem we are exploring. Pandas Profiling is probably the easiest way to do EDA quickly (although there are many other alternatives such as SweetViz). The downside of using Pandas Profiling is that it can be slow to give you a very in-depth analysis, even when not needed.
Jun-19-2022, 03:50:33 GMT
- Technology: