CoTFormer: More Tokens With Attention Make Up For Less Depth

Open in new window