ConsistencyChecker: Tree-based Evaluation of LLM Generalization Capabilities