AITopics | fooling

Collaborating Authors

fooling

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Fooling Neural Network Interpretations via Adversarial Model Manipulation

Juyeon Heo, Sunghwan Joo, Taesup Moon

Neural Information Processing SystemsFeb-12-2026, 17:47:30 GMT

Neural Information Processing Systems http://nips.cc/

explanation, fooling, interpretation method, (15 more...)

Neural Information Processing Systems

Country:

South America > Paraguay > Asunción > Asunción (0.04)
North America > Canada (0.04)
Asia > South Korea > Gyeonggi-do > Suwon (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Fooling Neural Network Interpretations via Adversarial Model Manipulation

Juyeon Heo, Sunghwan Joo, Taesup Moon

Neural Information Processing SystemsOct-3-2025, 02:19:03 GMT

SimpleGrad, can be easily fooled with our model manipulation.

artificial intelligence, fooling, machine learning, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Review for NeurIPS paper: Intra Order-preserving Functions for Calibration of Multi-Class Neural Networks

Neural Information Processing SystemsJan-26-2025, 23:38:35 GMT

Additional Feedback: The proposed methods perform very strongly in ECE, slightly better than the state-of-the-art in NLL and slightly worse in classwise-ECE. It would be good to have some explanation about why ECE and classwise-ECE give so different results. As ECE studies the calibration of only the class with the highest predicted probability and ignores other class probabilities, does it mean that the proposed method is better than the state-of-the-art in top-1 probability but slightly weaker on other classes? In the appendix provided as supplemental material, at lines 739-742 it is claimed that ECE does not suffer from the same problem that is highlighted about classwise-ECE at lines 731-738. While this is technically correct, it misses the point. Actually, ECE also suffers from essentially the same problem.

accuracy, intra order-preserving function, multi-class neural network, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.40)

Add feedback

Attack to Fool and Explain Deep Networks

Akhtar, Naveed, Jalwana, Muhammad A. A. K., Bennamoun, Mohammed, Mian, Ajmal

arXiv.org Artificial IntelligenceJun-19-2021

Deep visual models are susceptible to adversarial perturbations to inputs. Although these signals are carefully crafted, they still appear noise-like patterns to humans. This observation has led to the argument that deep visual representation is misaligned with human perception. We counter-argue by providing evidence of human-meaningful patterns in adversarial perturbations. We first propose an attack that fools a network to confuse a whole category of objects (source class) with a target label. Our attack also limits the unintended fooling by samples from non-sources classes, thereby circumscribing human-defined semantic notions for network fooling. We show that the proposed attack not only leads to the emergence of regular geometric patterns in the perturbations, but also reveals insightful information about the decision boundaries of deep models. Exploring this phenomenon further, we alter the `adversarial' objective of our attack to use it as a tool to `explain' deep visual representation. We show that by careful channeling and projection of the perturbations computed by our method, we can visualize a model's understanding of human-defined semantic notions. Finally, we exploit the explanability properties of our perturbations to perform image generation, inpainting and interactive image manipulation by attacking adversarialy robust `classifiers'.In all, our major contribution is a novel pragmatic adversarial attack that is subsequently transformed into a tool to interpret the visual models. The article also makes secondary contributions in terms of establishing the utility of our attack beyond the adversarial objective with multiple interesting applications.

classifier, perturbation, proceedings, (12 more...)

arXiv.org Artificial Intelligence

2106.10606

Country:

Oceania > Australia > Western Australia (0.04)
Europe > Germany (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Education > Educational Setting > Higher Education (0.46)
Information Technology > Security & Privacy (0.36)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Fooling Neural Network Interpretations via Adversarial Model Manipulation

Heo, Juyeon, Joo, Sunghwan, Moon, Taesup

arXiv.org Machine LearningFeb-6-2019

We ask whether the neural network interpretation methods can be fooled via adversarial model manipulation, which is defined as a model fine-tuning step that aims to radically alter the explanations without hurting the accuracy of the original model. By incorporating the interpretation results directly in the regularization term of the objective function for fine-tuning, we show that the state-of-the-art interpreters, e.g., LRP and Grad-CAM, can be easily fooled with our model manipulation. We propose two types of fooling, passive and active, and demonstrate such foolings generalize well to the entire validation set as well as transfer to other interpretation methods. Our results are validated by both visually showing the fooled explanations and reporting quantitative metrics that measure the deviations from the original explanations. We claim that the stability of neural network interpretation method with respect to our adversarial model manipulation is an important criterion to check for developing robust and reliable neural network interpretation method.

explanation, fooling, interpretation method, (13 more...)

arXiv.org Machine Learning

1902.02041

Country:

North America > United States (0.14)
Europe (0.14)
Asia > South Korea > Gyeonggi-do > Suwon (0.04)

Genre:

Research Report > New Finding (0.48)
Research Report > Promising Solution (0.34)

Industry: Government > Regional Government (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Denoising Autoencoders for Overgeneralization in Neural Networks

Spigler, Giacomo

arXiv.org Artificial IntelligenceOct-23-2018

Despite the recent developments that allowed neural networks to achieve impressive performance on a variety of applications, these models are intrinsically affected by the problem of overgeneralization, due to their partitioning of the full input space into the fixed set of target classes used during training. Thus it is possible for novel inputs belonging to categories unknown during training or even completely unrecognizable to humans to fool the system into classifying them as one of the known classes, even with a high degree of confidence. Solving this problem may help improve the security of such systems in critical applications, and may further lead to applications in the context of open set recognition and 1-class recognition. This paper presents a novel way to compute a confidence score using denoising autoencoders and shows that such confidence score can correctly identify the regions of the input space close to the training distribution by approximately identifying its local maxima.

artificial intelligence, confidence score, machine learning, (18 more...)

arXiv.org Artificial Intelligence

1709.04762

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Fooling all the people all the time: the rise of artificial intelligence and fake news

#artificialintelligenceMar-26-2018, 06:17:05 GMT

Modern artificial intelligence is way beyond playing chess; it has mastered Go and kicks butt in Dota 2, among other games. What started as a test-lab monkey has evolved into something akin to a prodigy child. Artificial intelligence, or AI, may still have to be fed information, but once it has gathered enough, it can come up with results that mimic the original data. First came the static images -- AI managed to create perfectly convincing images of people who have never existed. Then it showed it was perfectly capable of mimicking different seasons.

artificial intelligence, fooling

#artificialintelligence

Industry: Leisure & Entertainment > Games > Chess (0.35)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Fooling all the people all the time: the rise of artificial intelligence and fake news

#artificialintelligenceMar-24-2018, 22:30:55 GMT

artificial intelligence, footage, information, (5 more...)

#artificialintelligence

Industry:

Media > News (1.00)
Education > Health & Safety > School Safety & Security > School Violence (0.33)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Deep Learning can be easily fooled

#artificialintelligenceFeb-13-2017, 03:20:53 GMT

On a post I wrote last year, I talked about the fact that Deep Neural Network could not label a changed image correctly (e.g. Recently, a related result is shown by researchers from University of Wyoming and Cornell University. They produced images completely unrecognizable to human eyes (as shown in the right picture) while DNN will still label them to be familiar objects (such as cheetah/peacock/baseball/…) with 99.99% confidence. Researchers used one of the best Deep Neural Networks, the "AlexNet" trained on the 1.3-million-image ILSVRC 2012 ImageNet dataset, to achieve state-of-the-art performance, and "LeNet" model trained on the MNIST dataset to test if the result holds for other DNN architectures. "AlexNet" and "LeNet" are both provided by the Caffe Software package.

artificial intelligence, deep neural network, machine learning, (9 more...)

#artificialintelligence

Country: North America > United States > Wyoming (0.26)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.98)

Add feedback

People Who Claim They're Fine With Little Sleep May Be Fooling Themselves

Attention everyone who's smugly proclaimed that they "just don't need a full night's sleep." You might have fooled us coffee-chuggers before, but now there's evidence that you're not quite superhuman. According to a new paper published in the journal Brain and Behavior (via Medical Xpress), University of Utah researchers studied patterns in the 839 people, dividing them into two groups: those who slept six hours or fewer per night, and those who got more. They then divided the short sleepers into two more groups: those who felt fine during the day, and those who reported feeling drowsy. When they put them in the MRI scanner -- a dark, boring tube of white noise (perfect for a little nap) -- both sets of short sleepers showed signs of sleep in their brain patterns while getting scanned.

artificial intelligence, fooling, short sleeper, (2 more...)

Popular Science

Country: North America > United States > Utah (0.31)

Genre: Research Report > New Finding (0.60)

Industry: Health & Medicine > Health Care Technology (0.40)

Technology: Information Technology > Artificial Intelligence > Cognitive Science (0.41)

Add feedback