Learning Sequence Representations by Non-local Recurrent Neural Memory