SF$^2$Bench: Evaluating Data-Driven Models for Compound Flood Forecasting in South Florida