DataPerf: Benchmarks for Data-Centric AI Development Mark Mazumder
–Neural Information Processing Systems
Machine learning research has long focused on models rather than datasets, and prominent datasets are used for common ML tasks without regard to the breadth, difficulty, and faithfulness of the underlying problems. Neglecting the fundamental importance of data has given rise to inaccuracy, bias, and fragility in real-world applications, and research is hindered by saturation across existing dataset benchmarks.
Neural Information Processing Systems
Feb-7-2026, 23:54:59 GMT
- Country:
- Europe
- Netherlands > North Brabant
- Eindhoven (0.04)
- Switzerland > Zürich
- Zürich (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Netherlands > North Brabant
- North America
- Canada (0.04)
- United States > California
- San Diego County > San Diego (0.04)
- Europe
- Genre:
- Research Report > Promising Solution (0.67)
- Industry:
- Information Technology (0.46)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning
- Inductive Learning (0.67)
- Neural Networks > Deep Learning (1.00)
- Statistical Learning (0.67)
- Natural Language (1.00)
- Representation & Reasoning (1.00)
- Vision (0.93)
- Machine Learning
- Data Science > Data Quality (0.94)
- Artificial Intelligence
- Information Technology