When Parameter-efficient Tuning Meets General-purpose Vision-language Models