AITopics | salient region

Data augmentation is key to improving the generalization ability of deep learning models. Mixup is a simple and widely-used data augmentation technique that has proven effective in alleviating the problems of overfitting and data scarcity. Also, recent studies of saliency-aware Mixup in the image domain show that preserving discriminative parts is beneficial to improving the generalization performance. However, these Mixup-based data augmentations are underexplored in 3D vision, especially in point clouds. In this paper, we propose SageMix, a saliency-guided Mixup for point clouds to preserve salient local structures.

name change, sagemix, saliency-guided mixup, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.60)

Add feedback

Certified but Fooled! Breaking Certified Defences with Ghost Certificates

Vo, Quoc Viet, Haq, Tashreque M., Montague, Paul, Abraham, Tamas, Abbasnejad, Ehsan, Ranasinghe, Damith C.

arXiv.org Artificial IntelligenceNov-19-2025

Certified defenses promise provable robustness guarantees. We study the malicious exploitation of probabilistic certification frameworks to better understand the limits of guarantee provisions. Now, the objective is to not only mislead a classifier, but also manipulate the certification process to generate a robustness guarantee for an adversarial input certificate spoofing. A recent study in ICLR demonstrated that crafting large perturbations can shift inputs far into regions capable of generating a certificate for an incorrect class. Our study investigates if perturbations needed to cause a misclassification and yet coax a certified model into issuing a deceptive, large robustness radius for a target class can still be made small and imperceptible. We explore the idea of region-focused adversarial examples to craft imperceptible perturbations, spoof certificates and achieve certification radii larger than the source class ghost certificates. Extensive evaluations with the ImageNet demonstrate the ability to effectively bypass state-of-the-art certified defenses such as Densepure. Our work underscores the need to better understand the limits of robustness certification methods.

artificial intelligence, machine learning, perturbation, (18 more...)

arXiv.org Artificial Intelligence

2511.14003

Genre: Research Report > New Finding (0.68)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Data Science (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Beyond saliency: enhancing explanation of speech emotion recognition with expert-referenced acoustic cues

Nasr, Seham, Ren, Zhao, Johnson, David

arXiv.org Artificial IntelligenceNov-18-2025

Explainable AI (XAI) for Speech Emotion Recognition (SER) is critical for building transparent, trustworthy models. Current saliency-based methods, adapted from vision, highlight spectrogram regions but fail to show whether these regions correspond to meaningful acoustic markers of emotion, limiting faithfulness and interpretability. We propose a framework that overcomes these limitations by quantifying the magnitudes of cues within salient regions. This clarifies "what" is highlighted and connects it to "why" it matters, linking saliency to expert-referenced acoustic cues of speech emotions. Experiments on benchmark SER datasets show that our approach improves explanation quality by explicitly linking salient regions to theory-driven speech emotions expert-referenced acoustics. Compared to standard saliency methods, it provides more understandable and plausible explanations of SER models, offering a foundational step towards trustworthy speech-based affective computing.

artificial intelligence, emotion, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2511.11691

Country: Europe > Germany (0.28)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.94)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science > Emotion (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

MRFD: Multi-Region Fusion Decoding with Self-Consistency for Mitigating Hallucinations in LVLMs

Ge, Haonan, Wang, Yiwei, Yang, Ming-Hsuan, Cai, Yujun

arXiv.org Artificial IntelligenceOct-14-2025

Large Vision-Language Models (LVLMs) have shown strong performance across multimodal tasks. However, they often produce hallucinations -- text that is inconsistent with visual input, due to the limited ability to verify information in different regions of the image. To address this, we propose Multi-Region Fusion Decoding (MRFD), a training-free decoding method that improves factual grounding by modeling inter-region consistency. MRFD identifies salient regions using cross-attention, generates initial responses for each, and computes reliability weights based on Jensen-Shannon Divergence (JSD) among the responses. These weights guide a consistency-aware fusion of per-region predictions, using region-aware prompts inspired by Chain-of-Thought reasoning. Experiments across multiple LVLMs and benchmarks show that MRFD significantly reduces hallucinations and improves response factuality without requiring model updates.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2508.10264

Country: North America > United States (1.00)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.46)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.46)

Add feedback

9543942c237ded1b39b1fd37259ff88e-Paper-Conference.pdf

Neural Information Processing SystemsAug-17-2025, 02:36:14 GMT

artificial intelligence, machine learning, point cloud, (19 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)
Information Technology > Artificial Intelligence > Vision (0.68)

Add feedback

Semantically Informed Salient Regions Guided Radiology Report Generation

Hou, Zeyi, Wei, Zeqiang, Yan, Ruixin, Lang, Ning, Zhou, Xiuzhuang

arXiv.org Artificial IntelligenceJul-16-2025

--Recent advances in automated radiology report generation from chest X-rays using deep learning algorithms have the potential to significantly reduce the arduous workload of radiologists. However, due to the inherent massive data bias in radiology images, where abnormalities are typically subtle and sparsely distributed, existing methods often produce fluent yet medically inaccurate reports, limiting their applicability in clinical practice. T o address this issue effectively, we propose a Semantically Informed Salient Regions-guided (SISRNet) report generation method. Specifically, our approach explicitly identifies salient regions with medically critical characteristics using fine-grained cross-modal semantics. Then, SISRNet systematically focus on these high-information regions during both image modeling and report generation, effectively capturing subtle abnormal findings, mitigating the negative impact of data bias, and ultimately generating clinically accurate reports. Compared to its peers, SISRNet demonstrates superior performance on widely used IU-Xray and MIMIC-CXR datasets. HEST radiography is currently the most widely used medical imaging examination, playing a crucial role in clinical diagnosis [1] and epidemiological studies [2].

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2507.11015

Country: Asia > China (0.14)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Nuclear Medicine (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback

Filters

Collaborating Authors

salient region

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

398410ece9d7343091093a2a7f8ee381-Supplemental.pdf

398410ece9d7343091093a2a7f8ee381-Paper.pdf

06a9d51e04213572ef0720dd27a84792-Paper.pdf

06a9d51e04213572ef0720dd27a84792-Paper.pdf

SageMix: Saliency-Guided Mixup for Point Clouds

Certified but Fooled! Breaking Certified Defences with Ghost Certificates

Beyond saliency: enhancing explanation of speech emotion recognition with expert-referenced acoustic cues

MRFD: Multi-Region Fusion Decoding with Self-Consistency for Mitigating Hallucinations in LVLMs

9543942c237ded1b39b1fd37259ff88e-Paper-Conference.pdf

Semantically Informed Salient Regions Guided Radiology Report Generation