Stochastic Learning of Nonstationary Kernels for Natural Language Modeling