AITopics | resilience

50abc3e730e36b387ca8e02c26dc0a22-Paper.pdf

Neural Information Processing SystemsApr-25-2026, 21:27:28 GMT

abnormal feature, data mining, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > United States > Michigan (0.28)

Industry: Information Technology (0.47)

Technology:

Information Technology > Data Science > Data Mining (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Communications (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Towards Verified and Targeted Explanations through Formal Methods

Wang, Hanchen David, Lopez, Diego Manzanas, Robinette, Preston K., Oguz, Ipek, Johnson, Taylor T., Ma, Meiyi

arXiv.org Machine LearningApr-17-2026

As deep neural networks are deployed in safety-critical domains such as autonomous driving and medical diagnosis, stakeholders need explanations that are interpretable but also trustworthy with formal guarantees. Existing XAI methods fall short: heuristic attribution techniques (e.g., LIME, Integrated Gradients) highlight influential features but offer no mathematical guarantees about decision boundaries, while formal methods verify robustness yet remain untargeted, analyzing the nearest boundary regardless of whether it represents a critical risk. In safety-critical systems, not all misclassifications carry equal consequences; confusing a "Stop" sign for a "60 kph" sign is far more dangerous than confusing it with a "No Passing" sign. We introduce ViTaX (Verified and Targeted Explanations), a formal XAI framework that generates targeted semifactual explanations with mathematical guarantees. For a given input (class y) and a user-specified critical alternative (class t), ViTaX: (1) identifies the minimal feature subset most sensitive to the y->t transition, and (2) applies formal reachability analysis to guarantee that perturbing these features by epsilon cannot flip the classification to t. We formalize this through Targeted epsilon-Robustness, certifying whether a feature subset remains robust under perturbation toward a specific target class. ViTaX is the first method to provide formally guaranteed explanations of a model's resilience against user-identified alternatives. Evaluations on MNIST, GTSRB, EMNIST, and TaxiNet demonstrate over 30% fidelity improvement with minimal explanation cardinality.

artificial intelligence, machine learning, publicationdate, (18 more...)

arXiv.org Machine Learning

2604.14209

Country:

North America > United States > Tennessee > Davidson County > Nashville (0.05)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
Europe > Portugal > Porto > Porto (0.04)
(3 more...)

Genre: Research Report (0.50)

Industry: Transportation > Ground > Road (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.86)

Add feedback

Not all naked mole-rat queens go out in a blaze of bloody violence

Surprising study reveals peaceful succession is possible. More information Adding us as a Preferred Source in Google by using this link indicates that you would like to see more of our content in Google News results. Naked mole-rats are among the only eusocial mammals. Breakthroughs, discoveries, and DIY tips sent six days a week. Queen bees may get most of the glory, but there is another queen of the animal kingdom who is the linchpin of her entire society.

artificial intelligence, queen, succession, (11 more...)

Popular Science

Country: Africa > East Africa (0.05)

Genre: Research Report > New Finding (0.51)

Technology: Information Technology > Artificial Intelligence (0.35)

Add feedback

ADD for Multi-Bit Image Watermarking

Luo, An, Ding, Jie

arXiv.org Machine LearningApr-14-2026

As generative models enable rapid creation of high-fidelity images, societal concerns about misinformation and authenticity have intensified. A promising remedy is multi-bit image watermarking, which embeds a multi-bit message into an image so that a verifier can later detect whether the image is generated by someone and further identify the source by decoding the embedded message. Existing approaches often fall short in capacity, resilience to common image distortions, and theoretical justification. To address these limitations, we propose ADD (Add, Dot, Decode), a multi-bit image watermarking method with two stages: learning a watermark to be linearly combined with the multi-bit message and added to the image, and decoding through inner products between the watermarked image and the learned watermark. On the standard MS-COCO benchmark, we demonstrate that for the challenging task of 48-bit watermarking, ADD achieves 100\% decoding accuracy, with performance dropping by at most 2\% under a wide range of image distortions, substantially smaller than the 14\% average drop of state-of-the-art methods. In addition, ADD achieves substantial computational gains, with 2-fold faster embedding and 7.4-fold faster decoding than the fastest existing method. We further provide a theoretical analysis explaining why the learned watermark and the corresponding decoding rule are effective.

large language model, machine learning, natural language, (21 more...)

arXiv.org Machine Learning

2604.11491

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Minnesota (0.04)
Europe > Netherlands (0.04)

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
(3 more...)

Add feedback

ProTransformer: Robustify Transformers via Plug-and-Play Paradigm

Neural Information Processing SystemsMar-22-2026, 21:46:53 GMT

Transformer-based architectures have dominated various areas of machine learning in recent years. In this paper, we introduce a novel robust attention mechanism designed to enhance the resilience of transformer-based architectures. Crucially, this technique can be integrated into existing transformers as a plug-and-play layer, improving their robustness without the need for additional training or fine-tuning. Through comprehensive experiments and ablation studies, we demonstrate that our ProTransformer significantly enhances the robustness of transformer models across a variety of prediction tasks, attack mechanisms, backbone architectures, and data domains. Notably, without further fine-tuning, the ProTransformer consistently improves the performance of vanilla transformers by 19.5\%, 28.3\%, 16.1\%, and 11.4\% for BERT, ALBERT, DistilBERT, and RoBERTa, respectively, under the classical TextFooler attack. Furthermore, ProTransformer shows promising resilience in large language models (LLMs) against prompting-based attacks, improving the performance of T5 and LLaMA by 24.8\% and 17.8\%, respectively, and enhancing Vicuna by an average of 10.4\% against the Jailbreaking attack. Beyond the language domain, ProTransformer also demonstrates outstanding robustness in both vision and graph domains.

large language model, machine learning, natural language, (10 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

bf64451da212313c5ef1a00f49232c47-Paper-Conference.pdf

Neural Information Processing SystemsFeb-16-2026, 21:42:49 GMT

large language model, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
North America > Canada (0.04)
Europe > Spain (0.04)
(3 more...)

Industry: Information Technology > Security & Privacy (0.48)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.93)
(3 more...)

Add feedback

79358587d84628728199059f648824e6-Paper-Conference.pdf

Neural Information Processing SystemsFeb-15-2026, 23:57:19 GMT

artificial intelligence, graph, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > Illinois > Cook County > Chicago (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
(5 more...)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

wherex

Neural Information Processing SystemsFeb-8-2026, 15:54:24 GMT

Graph neural networks (GNNs) have shown the power in graph representation learningfornumeroustasks.

artificial intelligence, machine learning, xin, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > New Jersey > Essex County > Newark (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > China > Beijing > Beijing (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

The Earth Is Nearing an Environmental Tipping Point

WIREDDec-29-2025, 10:00:00 GMT

Today’s global coral bleaching events are the worst kind of climate warning.

coral reef, emission, environmental tipping point, (13 more...)

WIRED

Country:

Oceania > Australia (0.15)
North America > United States > California (0.15)
Europe > Germany > Brandenburg > Potsdam (0.05)
(8 more...)

Industry: Energy > Renewable > Geothermal (0.48)

Technology: Information Technology > Artificial Intelligence (0.70)

Add feedback

IMPRESS: Evaluating the Resilience of Imperceptible Perturbations Against Unauthorized Data Usage in Diffusion-Based Generative AI

Neural Information Processing SystemsDec-24-2025, 05:14:06 GMT

Diffusion-based image generation models, such as Stable Diffusion or DALL E 2, are able to learn from given images and generate high-quality samples following the guidance from prompts. For instance, they can be used to create artistic images that mimic the style of an artist based on his/her original artworks or to maliciously edit the original images for fake content. However, such ability also brings serious ethical issues without proper authorization from the owner of the original images. In response, several attempts have been made to protect the original images from such unauthorized data usage by adding imperceptible perturbations, which are designed to mislead the diffusion model and make it unable to properly generate new samples. In this work, we introduce a perturbation purification platform, named IMPRESS, to evaluate the effectiveness of imperceptible perturbations as a protective measure.IMPRESS is based on the key observation that imperceptible perturbations could lead to a perceptible inconsistency between the original image and the diffusion-reconstructed image, which can be used to devise a new optimization strategy for purifying the image, which may weaken the protection of the original image from unauthorized data usage (e.g., style mimicking, malicious editing).The proposed IMPRESS platform offers a comprehensive evaluation of several contemporary protection methods, and can be used as an evaluation platform for future protection methods.

imperceptible perturbation, original image, unauthorized data usage, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.76)

Add feedback

Filters

Collaborating Authors

resilience

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

50abc3e730e36b387ca8e02c26dc0a22-Paper.pdf

Towards Verified and Targeted Explanations through Formal Methods

Not all naked mole-rat queens go out in a blaze of bloody violence

ADD for Multi-Bit Image Watermarking

ProTransformer: Robustify Transformers via Plug-and-Play Paradigm

bf64451da212313c5ef1a00f49232c47-Paper-Conference.pdf

79358587d84628728199059f648824e6-Paper-Conference.pdf

wherex

The Earth Is Nearing an Environmental Tipping Point

IMPRESS: Evaluating the Resilience of Imperceptible Perturbations Against Unauthorized Data Usage in Diffusion-Based Generative AI