DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?

Open in new window