Direct Evaluation of Chain-of-Thought in Multi-hop Reasoning with Knowledge Graphs