Joint Evaluation of Answer and Reasoning Consistency for Hallucination Detection in Large Reasoning Models

Open in new window