TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic Data
Tiwald, Paul, Krchova, Ivona, Sidorenko, Andrey, Vieyra, Mariana Vargas, Scriminaci, Mario, Platzer, Michael
–arXiv.org Artificial Intelligence
Synthetic data generation for tabular datasets must balance fidelity, efficiency, and versatility to meet the demands of real-world applications. We introduce the Tabular Auto-Regressive Generative Network (TabularARGN), a flexible framework designed to handle mixed-type, multivariate, and sequential datasets. By training on all possible conditional probabilities, TabularARGN supports advanced features such as fairness-aware generation, imputation, and conditional generation on any subset of columns. The framework achieves state-of-the-art synthetic data quality while significantly reducing training and inference times, making it ideal for large-scale datasets with diverse structures. Evaluated across established benchmarks, including realistic datasets with complex relationships, TabularARGN demonstrates its capability to synthesize high-quality data efficiently. By unifying flexibility and performance, this framework paves the way for practical synthetic data generation across industries.
arXiv.org Artificial Intelligence
Feb-6-2025
- Country:
- North America > United States
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.04)
- California > Orange County
- Irvine (0.04)
- New York > New York County
- Europe
- Austria > Vienna (0.14)
- Switzerland (0.04)
- Monaco (0.04)
- North America > United States
- Genre:
- Research Report (1.00)
- Industry:
- Information Technology (0.67)
- Technology: