TaiSu: A 166M Large-scale High-Quality Dataset for Chinese Vision-Language Pre-training