Ethnicity sensitive author disambiguation using semi-supervised learning