Treeformer: Dense Gradient Trees for Efficient Attention Computation

Open in new window