AITopics | dataset bias

Collaborating Authors

dataset bias

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

DPA: AOne-stop Metric to Measure Bias Amplification in Classification Datasets

Neural Information Processing SystemsJun-22-2026, 23:07:38 GMT

Most ML datasets today contain biases. When we train models on these datasets, they often not only learn these biases but can worsen them -- a phenomenon known as bias amplification. Several co-occurrence-based metrics have been proposed to measure bias amplification in classification datasets. They measure bias amplification between a protected attribute (e.g., gender) and a task (e.g., cooking). These metrics also support fine-grained bias analysis by identifying the direction in which a model amplifies biases. However, co-occurrence-based metrics have limitations -- some fail to measure bias amplification in balanced datasets, while others fail to measure negative bias amplification.

bias amplification, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Genre: Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)

Add feedback

ConceptScope: Characterizing Dataset Bias via Disentangled Visual Concepts

Neural Information Processing SystemsJun-10-2026, 07:25:35 GMT

Dataset bias, where data points are skewed to certain concepts, is ubiquitous in machine learning datasets. Yet, systematically identifying these biases is challenging without costly, fine-grained attribute annotations. We present ConceptScope, a scalable and automated framework for analyzing visual datasets by discovering and quantifying human-interpretable concepts using Sparse Autoencoders trained on representations from vision foundation models. ConceptScope categorizes concepts into target, context, and bias types based on their semantic relevance and statistical correlation to class labels, enabling class-level dataset characterization, bias identification, and robustness evaluation through concept-based subgrouping.

artificial intelligence, machine learning, proceedings, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.78)

Add feedback

7172e147d916eef4cb1eb30016ce725f-Paper-Conference.pdf

Neural Information Processing SystemsFeb-15-2026, 19:20:57 GMT

accuracy, dataset, information, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania (0.04)
Europe > Poland (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.68)

Industry: Information Technology (0.67)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(3 more...)

Add feedback

eddc3427c5d77843c2253f1e799fe933-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-11-2026, 00:25:51 GMT

correlation, knowledge, lff, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

7a27143ea615262a0c122eb179c9b7a6-Paper-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 00:35:42 GMT

bert, computational linguistic, subnetwork, (15 more...)

Neural Information Processing Systems

Country: Asia > China (0.04)

Genre: Research Report > New Finding (0.93)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Communications (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

UncertaintyCalibrationforEnsemble-Based DebiasingMethods

Neural Information Processing SystemsFeb-9-2026, 08:17:29 GMT

A growing body of literature recognizes debiasing as an important direction in machine learning and natural language processing [38; 3; 4; 34].

bias-only model, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Country: Asia > China (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

LearningDebiasedandDisentangledRepresentations forSemanticSegmentation

Neural Information Processing SystemsFeb-8-2026, 10:56:36 GMT

Despite such phenomenal achievement, semantic segmentation approaches still suffer from the chronic limitations caused byclass imbalance andstereotyped scene contextindatasets.

artificial intelligence, machine learning, representation, (18 more...)

Neural Information Processing Systems

Country: Asia > South Korea > Seoul > Seoul (0.04)

Industry: Transportation > Ground > Road (0.32)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models

Neural Information Processing SystemsDec-24-2025, 13:05:32 GMT

Despite the remarkable success of pre-trained language models (PLMs), they still face two challenges: First, large-scale PLMs are inefficient in terms of memory footprint and computation. Second, on the downstream tasks, PLMs tend to rely on the dataset bias and struggle to generalize to out-of-distribution (OOD) data. In response to the efficiency problem, recent studies show that dense PLMs can be replaced with sparse subnetworks without hurting the performance. Such subnetworks can be found in three scenarios: 1) the fine-tuned PLMs, 2) the raw PLMs and then fine-tuned in isolation, and even inside 3) PLMs without any parameter fine-tuning. However, these results are only obtained in the in-distribution (ID) setting.

robust pre-trained language model, subnetwork, win-win deal, (9 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.55)

Technology: Information Technology > Artificial Intelligence > Natural Language (0.78)

Add feedback

Robot Learning in Homes: Improving Generalization and Reducing Dataset Bias

Neural Information Processing SystemsNov-20-2025, 23:17:49 GMT

Data-driven approaches to solving robotic tasks have gained a lot of traction in recent years. However, most existing policies are trained on large-scale datasets collected in curated lab settings. If we aim to deploy these models in unstructured visual environments like people's homes, they will be unable to cope with the mismatch in data distribution. In such light, we present the first systematic effort in collecting a large dataset for robotic grasping in homes. First, to scale and parallelize data collection, we built a low cost mobile manipulator assembled for under 3K USD.

dataset bias, name change, robot learning, (2 more...)

Neural Information Processing Systems

Technology: