DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?