AITopics

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.63)

Neural Information Processing SystemsAug-16-2025, 20:01:05 GMT

Explanation-based Data Augmentation for Image Classification

artificial intelligence, image understanding, machine learning, (19 more...)

Country:

Asia > Singapore (0.04)
North America > United States > California (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.84)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Neural Information Processing SystemsAug-14-2025, 06:11:32 GMT

2955_3db_a_framework_for_debugging_

Figure 16: Screenshot of the dashboard used for data exploration. Since experiments usually produce large amounts of data that can be hard to get a sense of, we created a data visualization dashboard. Given a folder containing the JSON logs of a job, it offers a user interface to explore the influence of the controls. For each parameter of each control, we can pick one out three mode: Heat map axis: This control will be used as the x or y axis of the heat map. Exactly two controls should be assigned to this mode to enable the visualization.

experiment, robustness, vision model, (17 more...)

Country: Europe > Germany > North Rhine-Westphalia > Düsseldorf Region > Düsseldorf (0.04)

Industry: Leisure & Entertainment (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.96)
Information Technology > Visualization (0.75)
Information Technology > Artificial Intelligence > Vision (0.71)
(2 more...)

Neural Information Processing SystemsMay-27-2025, 06:24:52 GMT

SLIM: Style-Linguistics Mismatch Model for Generalized Audio Deepfake Detection

Audio deepfake detection (ADD) is crucial to combat the misuse of speech synthesized by generative AI models. Existing ADD models suffer from generalization issues to unseen attacks, with a large performance discrepancy between in-domain and out-of-domain data. Moreover, the black-box nature of existing models limits their use in real-world scenarios, where explanations are required for model decisions. To alleviate these issues, we introduce a new ADD model that explicitly uses the Style-LInguistics Mismatch (SLIM) in fake speech to separate them from real speech. SLIM first employs self-supervised pretraining on only real samples to learn the style-linguistics dependency in the real class.

generalized audio deepfake detection, slim, style-linguistics mismatch model, (3 more...)

Industry: Information Technology > Security & Privacy (0.65)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.99)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.65)

arXiv.org Artificial IntelligenceDec-23-2024

Concept Complement Bottleneck Model for Interpretable Medical Image Diagnosis

Wang, Hongmei, Hou, Junlin, Chen, Hao

Models based on human-understandable concepts have received extensive attention to improve model interpretability for trustworthy artificial intelligence in the field of medical image analysis. These methods can provide convincing explanations for model decisions but heavily rely on the detailed annotation of pre-defined concepts. Consequently, they may not be effective in cases where concepts or annotations are incomplete or low-quality. Although some methods automatically discover effective and new visual concepts rather than using pre-defined concepts or could find some human-understandable concepts via large Language models, they are prone to veering away from medical diagnostic evidence and are challenging to understand. In this paper, we propose a concept complement bottleneck model for interpretable medical image diagnosis with the aim of complementing the existing concept set and finding new concepts bridging the gap between explainable models. Specifically, we propose to use concept adapters for specific concepts to mine the concept differences and score concepts in their own attention channels to support almost fairly concept learning. Then, we devise a concept complement strategy to learn new concepts while jointly using known concepts to improve model performance. Comprehensive experiments on medical datasets demonstrate that our model outperforms the state-of-the-art competitors in concept detection and disease diagnosis tasks while providing diverse explanations to ensure model interpretability effectively.

artificial intelligence, machine learning, natural language, (17 more...)

2410.15446

Country: Asia > China > Hong Kong (0.05)

Genre: Research Report (1.00)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

arXiv.org Artificial IntelligenceOct-3-2024

PCEvE: Part Contribution Evaluation Based Model Explanation for Human Figure Drawing Assessment and Beyond

Lee, Jongseo, Ahn, Geo, Kim, Seong Tae, Choi, Jinwoo

For automatic human figure drawing (HFD) assessment tasks, such as diagnosing autism spectrum disorder (ASD) using HFD images, the clarity and explainability of a model decision are crucial. Existing pixel-level attribution-based explainable AI (XAI) approaches demand considerable effort from users to interpret the semantic information of a region in an image, which can be often time-consuming and impractical. To overcome this challenge, we propose a part contribution evaluation based model explanation (PCEvE) framework. On top of the part detection, we measure the Shapley Value of each individual part to evaluate the contribution to a model decision. Unlike existing attribution-based XAI approaches, the PCEvE provides a straightforward explanation of a model decision, i.e., a part contribution histogram. Furthermore, the PCEvE expands the scope of explanations beyond the conventional sample-level to include class-level and task-level insights, offering a richer, more comprehensive understanding of model behavior. We rigorously validate the PCEvE via extensive experiments on multiple HFD assessment datasets. Also, we sanity-check the proposed method with a set of controlled experiments. Additionally, we demonstrate the versatility and applicability of our method to other domains by applying it to a photo-realistic dataset, the Stanford Cars.

machine learning, natural language, pceve, (19 more...)

2409.1826

Genre:

Research Report > Experimental Study (0.68)
Research Report > New Finding (0.46)

Industry: Health & Medicine > Therapeutic Area > Neurology > Autism (0.54)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Vision (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

arXiv.org Artificial IntelligenceSep-25-2024

Claim-Guided Textual Backdoor Attack for Practical Applications

Song, Minkyoo, Kim, Hanna, Kim, Jaehan, Jin, Youngjin, Shin, Seungwon

Recent advances in natural language processing and the increased use of large language models have exposed new security vulnerabilities, such as backdoor attacks. Previous backdoor attacks require input manipulation after model distribution to activate the backdoor, posing limitations in real-world applicability. Addressing this gap, we introduce a novel Claim-Guided Backdoor Attack (CGBA), which eliminates the need for such manipulations by utilizing inherent textual claims as triggers. CGBA leverages claim extraction, clustering, and targeted training to trick models to misbehave on targeted claims without affecting their performance on clean data. CGBA demonstrates its effectiveness and stealthiness across various datasets and models, significantly enhancing the feasibility of practical backdoor attacks. Our code and data will be available at https://github.com/PaperCGBA/CGBA.

backdoor, backdoor attack, dataset, (17 more...)

2409.16618

Country:

North America > United States (0.28)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Asia > South Korea (0.04)
Asia > Nepal (0.04)

Genre: Research Report (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Immunology (0.49)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Latorre, Laura, Petrychenko, Liliana, Beets-Tan, Regina, Kopytova, Taisiya, Silva, Wilson

Towards Case-based Interpretability for Medical Federated Learning

arXiv.org Artificial IntelligenceAug-24-2024

Even though federated learning's potential to overcome Case-based interpretability is vital in explaining medical some of the current AI flaws is currently widely recognized, Artificial Intelligence (AI) model decisions. Generating it also introduces new challenges. The decentralized nature explanations for AI model decisions is paramount to increasing of federated learning guarantees compliance with privacy trust and allowing widespread adoption in clinical regulations but, at the same time, inhibits data access and practice [1]. We can find several approaches to producing inspection [7]. Non-accessible data means that identifying explanations in the scientific literature, from saliency maps bugs or detecting biases is impossible following conventional (highlighting image pixels driving the decision) to textual approaches. The same is true for case-based explainability.

case-based explanation, dataset, generative model, (12 more...)

2408.13626

Country:

Europe > Netherlands > North Holland > Amsterdam (0.05)
Europe > Netherlands > Limburg > Maastricht (0.05)
South America > Peru > Lima Department > Lima Province > Lima (0.04)
(2 more...)

Genre: Research Report (0.50)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)
Health & Medicine > Nuclear Medicine (0.99)
Health & Medicine > Therapeutic Area (0.71)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Case-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Memory-Based Learning (0.87)

Pennisi, Matteo, Bellitto, Giovanni, Palazzo, Simone, Shah, Mubarak, Spampinato, Concetto

Diffexplainer: Towards Cross-modal Global Explanations with Diffusion Models

arXiv.org Artificial IntelligenceApr-3-2024

We present DiffExplainer, a novel framework that, leveraging language-vision models, enables multimodal global explainability. DiffExplainer employs diffusion models conditioned on optimized text prompts, synthesizing images that maximize class outputs and hidden features of a classifier, thus providing a visual tool for explaining decisions. Moreover, the analysis of generated visual descriptions allows for automatic identification of biases and spurious features, as opposed to traditional methods that often rely on manual intervention. The cross-modal transferability of language-vision models also enables the possibility to describe decisions in a more human-interpretable way, i.e., through text. We conduct comprehensive experiments, which include an extensive user study, demonstrating the effectiveness of DiffExplainer on 1) the generation of high-quality images explaining model decisions, surpassing existing activation maximization methods, and 2) the automated identification of biases and spurious features.

diffexplainer, international conference, spurious feature, (14 more...)

2404.02618

Country:

Europe > Italy (0.05)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Florida (0.04)
Europe > France (0.04)

Genre:

Research Report > Experimental Study (0.68)
Research Report > New Finding (0.46)

Industry: Information Technology > Security & Privacy (0.48)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceFeb-12-2024

CMA-R:Causal Mediation Analysis for Explaining Rumour Detection

Tian, Lin, Zhang, Xiuzhen, Lau, Jey Han

We apply causal mediation analysis to explain the decision-making process of neural models for rumour detection on Twitter. Interventions at the input and network level reveal the causal impacts of tweets and words in the model output. We find that our approach CMA-R -- Causal Mediation Analysis for Rumour detection -- identifies salient tweets that explain model predictions and show strong agreement with human judgements for critical tweets determining the truthfulness of stories. CMA-R can further highlight causally impactful words in the salient tweets, providing another layer of interpretability and transparency into these blackbox rumour detection systems. Code is available at: https://github.com/ltian678/cma-r.

cma-r, source tweet, tweet, (15 more...)