Social Bias Benchmark for Generation: A Comparison of Generation and QA-Based Evaluations