Hypothesis Testing in Machine Learning: What for and Why
Suppose you are working on a machine learning project, for which you want to predict if a set of patients have or not a mortal disease, based on several features on your dataset as blood pressure, heart rate, pulse and others. Sounds like a serious project, for which you'll need to really trust your model and predictions, right? That's why you got hundreds of samples, that your local hospital very gently allowed you to collect, given the importance and the seriousness of the topic. But how do you know if your sample is representative of the whole population? And how can we know how much difference might be reasonable?
Nov-2-2019, 20:37:47 GMT