Measuring Reasoning Utility in LLMs via Conditional Entropy Reduction