Ethical Considerations for Responsible Data Curation
–Neural Information Processing Systems
HCCV datasets constructed through nonconsensual web scraping lack crucial metadata for comprehensive fairness and robustness evaluations. Current remedies are post hoc, lack persuasive justification for adoption, or fail to provide proper contextualization for appropriate application. Our research focuses on proactive, domain-specific recommendations, covering purpose, privacy and consent, and diversity, for curating HCCV evaluation datasets, addressing privacy and bias concerns. We adopt an ante hoc reflective perspective, drawing from current practices, guidelines, dataset withdrawals, and audits, to inform our considerations and recommendations.
Neural Information Processing Systems
Apr-29-2026, 09:53:02 GMT
- Country:
- North America > United States (1.00)
- Europe (1.00)
- Asia (1.00)
- Genre:
- Research Report (0.93)
- Overview (0.67)
- Personal > Interview (0.48)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Media (0.93)
- Education (0.67)
- Law
- Civil Rights & Constitutional Law (1.00)
- Statutes (0.67)
- Health & Medicine
- Pharmaceuticals & Biotechnology (0.92)
- Consumer Health (0.68)
- Therapeutic Area > Psychiatry/Psychology (0.46)
- Government > Regional Government
- Technology:
- Information Technology
- Sensing and Signal Processing > Image Processing (1.00)
- Security & Privacy (1.00)
- Data Science > Data Mining (1.00)
- Communications > Social Media (1.00)
- Artificial Intelligence
- Robots (1.00)
- Representation & Reasoning (1.00)
- Natural Language (1.00)
- Issues > Social & Ethical Issues (1.00)
- Vision
- Image Understanding (1.00)
- Face Recognition (1.00)
- Machine Learning > Neural Networks
- Deep Learning (1.00)
- Information Technology