Weighted Sampling for Masked Language Modeling