AITopics

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.92)

Industry:

Law (1.00)
Government (0.92)
Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.95)
(2 more...)

Neural Information Processing SystemsFeb-11-2026, 07:30:30 GMT

d30d0f522a86b3665d8e3a9a91472e28-Paper.pdf

caption, proceedings, spurious correlation, (12 more...)

Country:

North America > United States > Virginia > Arlington County > Arlington (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > United States > California > Santa Barbara County > Santa Barbara (0.04)
(5 more...)

Genre: Research Report (0.93)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Karri, Sai Suhruth Reddy, Nallapuneni, Yashwanth Sai, Mallireddy, Laxmi Narasimha Reddy, G, Gopichand

LLM-Guided Synthetic Augmentation (LGSA) for Mitigating Bias in AI Systems

arXiv.org Artificial IntelligenceOct-16-2025

This is the preprint version of the article "LLM - Guided Synthetic Augmentation (LGSA) for Mitigating Bias in AI Systems." This version is made available on arXiv for early dissemination. If accepted, the final authenticated version will be published in the respective venue. Dr. G opichand G School of Computer Science and Engineering Vellore Institute of Technology Vellore - 632014, TamilNadu, India gopichand.g@vit.ac.in Abstract -- Bias in Artificial Intelligence systems, especially those that rely on natural language data, brings up serious ethical and practical issues. When certain groups are underrepresented, it often leads to uneven performance across different demographics. Whil e traditional fairness methods like pre - processing, in - processing, and post - processing can be helpful, they usually depend on protected - attribute labels, create a trade - off between accuracy and fairness, and struggle to adapt across various datas ets. To tackle these challenges, this study presents LLM - Guided Synthetic Augmentation (LGSA), a process that leverages large language models to create counterfactual examples for underrepresented groups while keeping label integrity intact. We put LGSA to the test on a controlled dataset of short English sentences that included gendered pronouns, professions, and binary task labels. The process involved using structured prompts to a large language model to generate gender - swapped paraphrases, followed by a thorough quality control process. This included checking for semantic similarity, verifying attributes, screening for toxi city, and conducting human spot checks. The augmented dataset broadened training coverage and was utilized to train a classifier under consistent experimental conditions. The results showed that LGSA significantly lessens performance disparities without co mpromising accuracy. The baseline model achieved an impressive 96.7% accuracy but had a gender bias gap of 7.2%. A simple swap augmentation brought the gap down to 0.7% but also reduced accuracy to 95.6%. In contrast, LGSA achieved an overall accuracy of 9 9.1%, showing strong performance on female - labeled examples and a reduced gap of 1.9%. These results indicate that LGSA is a powerful and dependable strategy for mitigating bias. By generating diverse and semantically accurate counterfactuals, this method enhances the balance of subgroup performance, narrows bias gaps, and maintains high ove rall task accuracy and label fidelity, showcasing its potential as a practical framework for fairness - focused AI systems.

large language model, machine learning, natural language, (20 more...)

2510.13202

Country: Asia > India (0.24)

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Law (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

arXiv.org Artificial IntelligenceOct-10-2025

CaRT: Teaching LLM Agents to Know When They Know Enough

Liu, Grace, Qu, Yuxiao, Schneider, Jeff, Singh, Aarti, Kumar, Aviral

Many tasks require learned models to strategically gather relevant information over multiple rounds of interaction before actually acting on a task. Strategic information gathering requires models to know not only how to effectively acquire information, but also when to stop gathering information and make a decision, in order to avoid overthinking or getting derailed when acting. In this paper, we formalize this problem and introduce Counterfactuals and Reasoning for Termination (CaRT), an approach for teaching LLMs when to stop seeking information. To appropriately learn when to terminate, CaRT fine-tunes LLMs using counterfactual pairs of trajectories, one where termination is appropriate and a minimally modified version of the same trajectory where it is not. It trains the LLM to explain the rationale for the termination decision in either case via verbal reasoning, and imbues this capability into the base LLM via fine-tuning. We instantiate CaRT in two domains: interactive medical diagnosis and math problem solving. In both domains, we find that CaRT improves the efficiency of information gathering and task success rate compared to other fine-tuning methods.

information, large language model, machine learning, (19 more...)

2510.08517

Country: North America > United States (0.68)

Genre: Research Report > New Finding (0.67)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Diagnostic Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Neural Information Processing SystemsOct-9-2025, 09:45:56 GMT

e14e4cb8266184ceb234973dfe07faed-Paper-Datasets_and_Benchmarks.pdf

artificial intelligence, machine learning, natural language, (18 more...)

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.92)

Industry:

Law (1.00)
Government (0.92)
Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.95)
(2 more...)

Wang, Qianli, Nguyen, Van Bach, Feldhus, Nils, Villa-Arenas, Luis Felipe, Seifert, Christin, Möller, Sebastian, Schmitt, Vera

Truth or Twist? Optimal Model Selection for Reliable Label Flipping Evaluation in LLM-based Counterfactuals

arXiv.org Artificial IntelligenceAug-28-2025

Counterfactual examples are widely employed to enhance the performance and robustness of large language models (LLMs) through counterfactual data augmentation (CDA). However, the selection of the judge model used to evaluate label flipping, the primary metric for assessing the validity of generated counterfactuals for CDA, yields inconsistent results. To decipher this, we define four types of relationships between the counterfactual generator and judge models: being the same model, belonging to the same model family, being independent models, and having an distillation relationship. Through extensive experiments involving two state-of-the-art LLM-based methods, three datasets, four generator models, and 15 judge models, complemented by a user study (n = 90), we demonstrate that judge models with an independent, non-fine-tuned relationship to the generator model provide the most reliable label flipping evaluations. Relationships between the generator and judge models, which are closely aligned with the user study for CDA, result in better model performance and robustness. Nevertheless, we find that the gap between the most effective judge models and the results obtained from the user study remains considerably large. This suggests that a fully automated pipeline for CDA may be inadequate and requires human intervention.

large language model, machine learning, natural language, (20 more...)

2505.13972

Country:

Asia (1.00)
Europe (0.93)
North America > United States > Minnesota (0.28)

Genre:

Questionnaire & Opinion Survey (1.00)
Research Report > New Finding (0.68)

Industry: Leisure & Entertainment > Sports > Baseball (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.79)

Janisiów, Łukasz, Kochańczyk, Marek, Zieliński, Bartosz, Danel, Tomasz

Enhancing Chemical Explainability Through Counterfactual Masking

arXiv.org Artificial IntelligenceAug-27-2025

Molecular property prediction is a crucial task that guides the design of new compounds, including drugs and materials. While explainable artificial intelligence methods aim to scrutinize model predictions by identifying influential molecular substructures, many existing approaches rely on masking strategies that remove either atoms or atom-level features to assess importance via fidelity metrics. These methods, however, often fail to adhere to the underlying molecular distribution and thus yield unintuitive explanations. In this work, we propose counterfactual masking, a novel framework that replaces masked substructures with chemically reasonable fragments sampled from generative models trained to complete molecular graphs. Rather than evaluating masked predictions against implausible zeroed-out baselines, we assess them relative to counterfactual molecules drawn from the data distribution. Our method offers two key benefits: (1) molecular realism underpinning robust and distribution-consistent explanations, and (2) meaningful counterfactuals that directly indicate how structural modifications may affect predicted properties. We demonstrate that counterfactual masking is well-suited for benchmarking model explainers and yields more actionable insights across multiple datasets and property prediction tasks.

artificial intelligence, machine learning, natural language, (21 more...)

2508.18561

Country: Europe > Poland (0.14)

Genre: Research Report (1.00)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

arXiv.org Artificial IntelligenceAug-26-2025

Explainable Counterfactual Reasoning in Depression Medication Selection at Multi-Levels (Personalized and Population)

Qin, Xinyu, Chignell, Mark H., Greifenberger, Alexandria, Lokuge, Sachinthya, Toumeh, Elssa, Sternat, Tia, Katzman, Martin, Wang, Lu

Background: This study investigates how variations in Major Depressive Disorder (MDD) symptoms, quantified by the Hamilton Rating Scale for Depression (HAM-D), causally influence the prescription of SSRIs versus SNRIs. Methods: We applied explainable counterfactual reasoning with counterfactual explanations (CFs) to assess the impact of specific symptom changes on antidepressant choice. Results: Among 17 binary classifiers, Random Forest achieved highest performance (accuracy, F1, precision, recall, ROC-AUC near 0.85). Sample-based CFs revealed both local and global feature importance of individual symptoms in medication selection. Conclusions: Counterfactual reasoning elucidates which MDD symptoms most strongly drive SSRI versus SNRI selection, enhancing interpretability of AI-based clinical decision support systems. Future work should validate these findings on more diverse cohorts and refine algorithms for clinical deployment.

artificial intelligence, feature importance, machine learning, (16 more...)

2508.17207

Country:

North America > United States (1.00)
North America > Canada > Ontario > Toronto (0.14)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.94)

Industry:

Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)

Neural Information Processing SystemsAug-22-2025, 01:13:38 GMT

d30d0f522a86b3665d8e3a9a91472e28-Paper.pdf

artificial intelligence, machine learning, natural language, (18 more...)

Country:

North America > United States > Virginia > Arlington County > Arlington (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > United States > California > Santa Barbara County > Santa Barbara (0.04)
(5 more...)

Genre: Research Report (0.93)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Veglianti, Fabiano, Giorgi, Flavio, Silvestri, Fabrizio, Tolomei, Gabriele

Generalizability vs. Counterfactual Explainability Trade-Off

arXiv.org Artificial IntelligenceMay-30-2025

In this work, we investigate the relationship between model generalization and counterfactual explainability in supervised learning. We introduce the notion of $\varepsilon$-valid counterfactual probability ($\varepsilon$-VCP) -- the probability of finding perturbations of a data point within its $\varepsilon$-neighborhood that result in a label change. We provide a theoretical analysis of $\varepsilon$-VCP in relation to the geometry of the model's decision boundary, showing that $\varepsilon$-VCP tends to increase with model overfitting. Our findings establish a rigorous connection between poor generalization and the ease of counterfactual generation, revealing an inherent trade-off between generalization and counterfactual explainability. Empirical results validate our theory, suggesting $\varepsilon$-VCP as a practical proxy for quantitatively characterizing overfitting.

artificial intelligence, decision boundary, machine learning, (13 more...)

2505.23225

Country:

North America > United States (0.14)
Europe > Italy (0.14)

Genre: Research Report > New Finding (0.67)

Industry: Information Technology > Security & Privacy (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)