Evaluating and Mitigating Social Bias for Large Language Models in Open-ended Settings