R-squared for Decision Tree


I use the methodology you speak of all the time. I was the original programer for Breiman and Stone's version of CART in the late 70's which is where I believe I was first introduced to that method. However we were very careful to use the term variation explained since there is little relationship to the theoretical Pearson "r". Be aware that this value can go negative. Which implies that parts of your model behave a lot higher variation then the population variance.

