PHANTOM: ABenchmark for Hallucination Detection in Financial Long-Context QA

Open in new window