Can bidirectional encoder become the ultimate winner for downstream applications of foundation models?