Appendix for Wukong: A100 Million Large-scale Chinese Cross-modal Pretraining Benchmark A Examples in Wukong Dataset

Neural Information Processing Systems 

A diverse range of concepts are included. Figure 2: The word cloud generated with texts in Wukong dataset. For example, " 月 " means month; " 日 " is day; " 做 " is do and " 一个 " means one. Figure 1 shows some examples in our dataset. These image-text pairs involve many types of content, e.g., social news, sporting events, product introduction, et al.