r/MachineLearning - [D] Decision Tree Splitting strategy

Dec-24-2019, 13:39:34 GMT–#artificialintelligence

I have a dataset with 4 categorical features (Cholesterol, Systolic Blood pressure, diastolic blood pressure, and smoking rate). I use a decision tree classifier to find the probability of stroke. I am trying to verify my understanding of the splitting procedure done by Python Sklearn. Since it is a binary tree, there are three possible ways to split the first feature which is either to group categories {0 and 1 to a leaf, 2 to another leaf} or {0 and 2, 1}, or {0, 1 and 2}. What I know (please correct me here) is that the chosen split is the one with the highest information gain.

decision tree splitting strategy, information gain, machinelearning, (1 more...)

#artificialintelligence

Dec-24-2019, 13:39:34 GMT

News Web Page

Add feedback

Industry:
- Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)

Technology:
- Information Technology
  - Communications > Social Media (0.76)
  - Artificial Intelligence
    - Representation & Reasoning > Diagnosis (0.71)
    - Machine Learning > Decision Tree Learning (0.71)