HD-Eval: Aligning Large Language Model Evaluators Through Hierarchical Criteria Decomposition