SAGE: A Realistic Benchmark for Semantic Understanding