ParaFormer: Shallow Parallel Transformers with Progressive Approximation