DataPerf: Benchmarks for Data-Centric AI Development Mark Mazumder
–Neural Information Processing Systems
Machine learning research has long focused on models rather than datasets, and prominent datasets are used for common ML tasks without regard to the breadth, difficulty, and faithfulness of the underlying problems. Neglecting the fundamental importance of data has given rise to inaccuracy, bias, and fragility in real-world applications, and research is hindered by saturation across existing dataset benchmarks.
Neural Information Processing Systems
Feb-7-2026, 23:54:59 GMT
- Country:
- North America
- Canada (0.04)
- United States > California
- San Diego County > San Diego (0.04)
- Europe
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Switzerland > Zürich
- Zürich (0.04)
- Netherlands > North Brabant
- Eindhoven (0.04)
- United Kingdom > England
- North America
- Genre:
- Research Report > Promising Solution (0.67)
- Industry:
- Information Technology (0.46)
- Technology:
- Information Technology
- Data Science > Data Quality (0.94)
- Artificial Intelligence
- Representation & Reasoning (1.00)
- Natural Language (1.00)
- Vision (0.93)
- Machine Learning
- Neural Networks > Deep Learning (1.00)
- Statistical Learning (0.67)
- Inductive Learning (0.67)
- Information Technology