Sequence-to-Sequence Learning with Latent Neural Grammars

Neural Information Processing Systems 

Sequence-to-sequence learning with neural networks has become the de facto standard for sequence modeling.