AITopics | wga

Collaborating Authors

wga

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

dc4db2ff2c1aefce3b594f821ea82fe6-Paper-Conference.pdf

Neural Information Processing SystemsFeb-18-2026, 09:25:39 GMT

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.94)

Industry: Information Technology > Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

On the Unreasonable Effectiveness of Last-layer Retraining

Hill, John C., LaBonte, Tyler, Zhang, Xinchen, Muthukumar, Vidya

arXiv.org Artificial IntelligenceDec-2-2025

Last-layer retraining (LLR) methods -- wherein the last layer of a neural network is reinitialized and retrained on a held-out set following ERM training -- have garnered interest as an efficient approach to rectify dependence on spurious correlations and improve performance on minority groups. Surprisingly, LLR has been found to improve worst-group accuracy even when the held-out set is an imbalanced subset of the training set. We initially hypothesize that this ``unreasonable effectiveness'' of LLR is explained by its ability to mitigate neural collapse through the held-out set, resulting in the implicit bias of gradient descent benefiting robustness. Our empirical investigation does not support this hypothesis. Instead, we present strong evidence for an alternative hypothesis: that the success of LLR is primarily due to better group balance in the held-out set. We conclude by showing how the recent algorithms CB-LLR and AFR perform implicit group-balancing to elicit a robustness improvement.

artificial intelligence, llr, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2512.01766

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Mitigating Spurious Correlations in Patch-wise Tumor Classification on High-Resolution Multimodal Images

Asaad, Ihab, Shadaydeh, Maha, Denzler, Joachim

arXiv.org Artificial IntelligenceNov-18-2025

Patch-wise multi-label classification provides an efficient alternative to full pixel-wise segmentation on high-resolution images, particularly when the objective is to determine the presence or absence of target objects within a patch rather than their precise spatial extent. This formulation substantially reduces annotation cost, simplifies training, and allows flexible patch sizing aligned with the desired level of decision granularity. In this work, we focus on a special case, patch-wise binary classification, applied to the detection of a single class of interest (tumor) on high-resolution multimodal nonlinear microscopy images. We show that, although this simplified formulation enables efficient model development, it can introduce spurious correlations between patch composition and labels: tumor patches tend to contain larger tissue regions, whereas non-tumor patches often consist mostly of background with small tissue areas. We further quantify the bias in model predictions caused by this spurious correlation, and propose to use a debiasing strategy to mitigate its effect. Specifically, we apply GERNE, a debiasing method that can be adapted to maximize worst-group accuracy (WGA). Our results show an improvement in WGA by approximately 7% compared to ERM for two different thresholds used to binarize the spurious feature. This enhancement boosts model performance on critical minority cases, such as tumor patches with small tissues and non-tumor patches with large tissues, and underscores the importance of spurious correlation-aware learning in patch-wise classification problems.

artificial intelligence, correlation, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2511.13527

Country: Europe (0.15)

Genre: Research Report > New Finding (0.68)

Industry:

Health & Medicine > Therapeutic Area > Oncology (0.96)
Health & Medicine > Diagnostic Medicine (0.94)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

SAGE: Spuriousness-Aware Guided Prompt Exploration for Mitigating Multimodal Bias

Ye, Wenqian, Wang, Di, Zheng, Guangtao, Liu, Bohan, Zhang, Aidong

arXiv.org Artificial IntelligenceNov-18-2025

Large vision-language models, such as CLIP, have shown strong zero-shot classification performance by aligning images and text in a shared embedding space. However, CLIP models often develop multimodal spurious biases, which is the undesirable tendency to rely on spurious features. For example, CLIP may infer object types in images based on frequently co-occurring backgrounds rather than the object's core features. This bias significantly impairs the robustness of pre-trained CLIP models on out-of-distribution data, where such cross-modal associations no longer hold. Existing methods for mitigating multimodal spurious bias typically require fine-tuning on downstream data or prior knowledge of the bias, which undermines the out-of-the-box usability of CLIP. In this paper, we first theoretically analyze the impact of multimodal spurious bias in zero-shot classification. Based on this insight, we propose Spuriousness-Aware Guided Exploration (SAGE), a simple and effective method that mitigates spurious bias through guided prompt selection. SAGE requires no training, fine-tuning, or external annotations. It explores a space of prompt templates and selects the prompts that induce the largest semantic separation between classes, thereby improving worst-group robustness. Extensive experiments on four real-world benchmark datasets and five popular backbone models demonstrate that SAGE consistently improves zero-shot performance and generalization, outperforming previous zero-shot approaches without any external knowledge or model updates.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2511.13005

Country: North America > United States (0.68)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

dc4db2ff2c1aefce3b594f821ea82fe6-Paper-Conference.pdf

Neural Information Processing SystemsOct-11-2025, 00:43:49 GMT

accuracy, dataset, eigenvalue, (15 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.94)

Industry: Information Technology > Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(2 more...)

Add feedback

Efficient Bias Mitigation Without Privileged Information

Zarlenga, Mateo Espinosa, Sankaranarayanan, Swami, Andrews, Jerone T. A., Shams, Zohreh, Jamnik, Mateja, Xiang, Alice

arXiv.org Artificial IntelligenceSep-26-2024

Deep neural networks trained via empirical risk minimisation often exhibit significant performance disparities across groups, particularly when group and task labels are spuriously correlated (e.g., "grassy background" and "cows"). Existing bias mitigation methods that aim to address this issue often either rely on group labels for training or validation, or require an extensive hyperparameter search. Such data and computational requirements hinder the practical deployment of these methods, especially when datasets are too large to be group-annotated, computational resources are limited, and models are trained through already complex pipelines. In this paper, we propose Targeted Augmentations for Bias Mitigation (TAB), a simple hyperparameter-free framework that leverages the entire training history of a helper model to identify spurious samples, and generate a group-balanced training set from which a robust model can be trained. We show that TAB improves worst-group performance without any group information or model selection, outperforming existing methods while maintaining overall accuracy.

accuracy, dataset, hyperparameter, (13 more...)

arXiv.org Artificial Intelligence

2409.17691

Country:

North America > United States > California (0.04)
North America > Canada > Quebec > Montreal (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine (0.46)
Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

The Group Robustness is in the Details: Revisiting Finetuning under Spurious Correlations

LaBonte, Tyler, Hill, John C., Zhang, Xinchen, Muthukumar, Vidya, Kumar, Abhishek

arXiv.org Artificial IntelligenceJul-18-2024

Modern machine learning models are prone to over-reliance on spurious correlations, which can often lead to poor performance on minority groups. In this paper, we identify surprising and nuanced behavior of finetuned models on worst-group accuracy via comprehensive experiments on four well-established benchmarks across vision and language tasks. We first show that the commonly used class-balancing techniques of mini-batch upsampling and loss upweighting can induce a decrease in worst-group accuracy (WGA) with training epochs, leading to performance no better than without class-balancing. While in some scenarios, removing data to create a class-balanced subset is more effective, we show this depends on group structure and propose a mixture method which can outperform both techniques. Next, we show that scaling pretrained models is generally beneficial for worst-group accuracy, but only in conjuction with appropriate class-balancing. Finally, we identify spectral imbalance in finetuning features as a potential source of group disparities -- minority group covariance matrices incur a larger spectral norm than majority groups once conditioned on the classes. Our results show more nuanced interactions of modern finetuned models with group robustness than was previously known. Our code is available at https://github.com/tmlabonte/revisiting-finetuning.

accuracy, cit, dataset, (16 more...)

arXiv.org Artificial Intelligence

2407.13957

Country: North America > United States > California (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine (0.46)
Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Are Compressed Language Models Less Subgroup Robust?

Gee, Leonidas, Zugarini, Andrea, Quadrianto, Novi

arXiv.org Artificial IntelligenceMar-26-2024

To reduce the inference cost of large language models, model compression is increasingly used to create smaller scalable models. However, little is known about their robustness to minority subgroups defined by the labels and attributes of a dataset. In this paper, we investigate the effects of 18 different compression methods and settings on the subgroup robustness of BERT language models. We show that worst-group performance does not depend on model size alone, but also on the compression method used. Additionally, we find that model compression does not always worsen the performance on minority subgroups. Altogether, our analysis serves to further research into the subgroup robustness of model compression.

compression, multinli, subgroup, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.18653/v1/2023.emnlp-main.983

2403.17811

Country:

North America > United States (0.18)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
Europe > Spain (0.04)
(2 more...)

Genre: Research Report (0.50)

Industry:

Law > Government & the Courts (0.32)
Government > Regional Government (0.32)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.34)

Add feedback

Robustness to Subpopulation Shift with Domain Label Noise via Regularized Annotation of Domains

Stromberg, Nathan, Ayyagari, Rohan, Welfert, Monica, Koyejo, Sanmi, Sankar, Lalitha

arXiv.org Machine LearningFeb-16-2024

Existing methods for last layer retraining that aim to optimize worst-group accuracy (WGA) rely heavily on well-annotated groups in the training data. We show, both in theory and practice, that annotation-based data augmentations using either downsampling or upweighting for WGA are susceptible to domain annotation noise, and in high-noise regimes approach the WGA of a model trained with vanilla empirical risk minimization. We introduce Regularized Annotation of Domains (RAD) in order to train robust last layer classifiers without the need for explicit domain annotations. Our results show that RAD is competitive with other recently proposed domain annotation-free techniques. Most importantly, RAD outperforms state-of-the-art annotation-reliant methods even with only 5% noise in the training data for several publicly available datasets.

erm, noise, wga, (15 more...)

arXiv.org Machine Learning

2402.11039

Country: North America > United States > Arizona (0.04)

Genre: Research Report > New Finding (0.86)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

The Hollywood Strikes Stopped AI From Taking Your Job. But for How Long?

WIREDDec-25-2023, 12:00:00 GMT

Revolt against the machines began at Swingers. And at Bob's Big Boy, where for weeks Drew Carey picked up the tab. Members of the Writers Guild of America, or WGA, met at both Los Angeles-area diners frequently during their 148-day strike, which hinged on protecting Hollywood's scribes from being overrun by the march of artificial intelligence. Members of the WGA were just a small part of the resistance. The Screen Actors Guild--American Federation of Television and Radio Artists, or SAG-AFTRA, soon joined them on the picket lines, together forming a formidable uprising against the perceived threat of AI.

hollywood strike stopped ai, union, writer guild, (3 more...)

WIRED

Country: North America > United States > California > Los Angeles County > Los Angeles (0.26)

Industry: Media (0.74)

Technology: Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.74)

Add feedback