Have Unbalanced Classes? Try Significant Terms
The words that are significant to a class can be used improve the precision-recall trade off in classification. And it is tougher (sorry Yogi!) when the target classes to predict have widely varying supports. But that does happen often with real world datasets. Case in point is the prediction of a near future CCU readmission of a patient based on a discharge note. Only a small fraction of patients get readmitted to CCU within 30 days of a discharge. Our analysis of MIMIC-III dataset in the previous post showed that over 93% of the patients did not require readmission.
Dec-24-2019, 13:34:51 GMT
- Technology: