Improving Language Modelling with Noise Contrastive Estimation