SEER: The Span-based Emotion Evidence Retrieval Benchmark