Informative missingness and its implications in semi-supervised learning