Beyond Pass@k: Breadth-Depth Metrics for Reasoning Boundaries

Open in new window