Language Modeling With Factorization Memory