ParaFormer: Shallow Parallel Transformers with Progressive Approximation

Open in new window