A Tensorized Transformer for Language Modeling

Xindian Ma, Peng Zhang, Shuai Zhang, Nan Duan, Yuexian Hou, Ming Zhou, Dawei Song

Neural Information Processing Systems 

Latest development of neural models has connected the encoder and decoder through a self-attention mechanism.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found