Towards Generalizable Context-aware Anomaly Detection: A Large-scale Benchmark in Cloud Environments