Towards Region-aware Bias Evaluation Metrics