Language Model Pre-Training with Sparse Latent Typing