What Are the Data-Centric AI Concepts behind GPT Models?