Transformer in Transformer Kai Han 1,2 An Xiao 2 Enhua Wu

Neural Information Processing Systems 

Features of both words and sentences will be aggregated to enhance the representation ability.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found