NovelHopQA: Diagnosing Multi-Hop Reasoning Failures in Long Narrative Contexts

Open in new window