Generative Models for Synthetic Data: Transforming Data Mining in the GenAI Era
Li, Dawei, Huang, Yue, Li, Ming, Zhou, Tianyi, Zhang, Xiangliang, Liu, Huan
–arXiv.org Artificial Intelligence
Generative models such as Large Language Models, Diffusion Models, and generative adversarial networks have recently revolutionized the creation of synthetic data, offering scalable solutions to data scarcity, privacy, and annotation challenges in data mining. This tutorial introduces the foundations and latest advances in synthetic data generation, covers key methodologies and practical frameworks, and discusses evaluation strategies and applications. Attendees will gain actionable insights into leveraging generative synthetic data to enhance data mining research and practice. More information can be found on our website: https://syndata4dm.github.io/.
arXiv.org Artificial Intelligence
Aug-28-2025
- Country:
- Asia
- China
- Beijing > Beijing (0.04)
- Jiangsu Province > Yancheng (0.04)
- Shaanxi Province > Xi'an (0.04)
- Middle East > Saudi Arabia (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.42)
- China
- Europe > France (0.04)
- North America > United States
- Arizona > Maricopa County
- Tempe (0.05)
- California > San Diego County
- San Diego (0.04)
- Indiana > Saint Joseph County
- South Bend (0.05)
- Maryland > Prince George's County
- College Park (0.15)
- New York > New York County
- New York City (0.04)
- Texas (0.04)
- Arizona > Maricopa County
- Asia
- Genre:
- Industry:
- Education (1.00)
- Government > Regional Government (0.68)
- Health & Medicine (1.00)
- Information Technology > Security & Privacy (0.69)
- Technology: