Modeling Text with Decision Forests using Categorical-Set Splits