Treeformer: Dense Gradient Trees for Efficient Attention Computation