No Language Data Left Behind: A Comparative Study of CJK Language Datasets in the Hugging Face Ecosystem