AITopics | cutmix

Collaborating Authors

cutmix

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Inducing Spatial Locality in Vision Transformers through the Training Protocol

Toledo, Eduardo Santiago, Martínez, Asael Fabian

arXiv.org Machine LearningMay-19-2026

We investigate whether the training protocol can induce spatial locality in the early layers of a Vision Transformer (ViT) trained from scratch, without large-scale pretraining. Keeping the architecture and optimization procedure fixed, we compare a Baseline protocol with a Modern protocol (AutoAugment/ColorJitter, CutMix, and Label Smoothing) on CIFAR-10, CIFAR-100, and Tiny-ImageNet, characterizing each attention head via Mean Attention Distance (MAD) and normalized entropy. Across all three datasets, the Modern protocol produces more local and more concentrated attention in early layers; on CIFAR-100, the minimum MAD drops from 0.316 (Baseline) to 0.008 (Modern). To identify the source of this effect, we conduct an ablation study on CIFAR-100 by adding or removing each component individually. The results identify CutMix as the determining component within our experiments: all conditions with CutMix exhibit MAD 0.024, while all conditions without CutMix remain at MAD 0.210. AutoAugment and Label Smoothing show no independent effect on locality. Taken together, these findings suggest that the pressure to classify from partial image regions, induced by CutMix, can promote the emergence of local attention in Vision Transformers.

artificial intelligence, machine learning, protocol, (16 more...)

arXiv.org Machine Learning

2605.1639

Country: South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.76)

Genre: Research Report > New Finding (0.88)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

fb4c48608ce8825b558ccf07169a3421-Supplemental.pdf

Neural Information Processing SystemsApr-27-2026, 22:53:20 GMT

In this section, we perform additional diagnostics that give us confidence that our models are not doing any form of gradient obfuscation or masking [3, 53]. First, we report in Table 4 the robust accuracy obtained by our strongest models against a diverse set of attacks. The cascade is composed as follows: AUTOPGD-CE, an untargeted attack using PGD with an adaptive step on the cross-entropy loss [10], AUTOPGD-T, a targeted attack using PGD with an adaptive step on the difference of logits ratio [10], FAB-T, a targeted attack which minimizes the norm of adversarial perturbations [9], SQUARE, a query-efficient black-box attack [1]. First, we observe that our combination of attacks, denoted AA+MT matches the final robust accuracy measured by AUTOATTACK. Second, we also notice that the black-box attack (i.e., SQUARE) does not find any additional adversarial examples.

accuracy, artificial intelligence, robust accuracy, (17 more...)

Neural Information Processing Systems

Industry: Transportation > Air (0.55)

Technology: Information Technology > Artificial Intelligence (0.70)

Add feedback

Data Augmentation Can Improve Robustness

Neural Information Processing SystemsApr-27-2026, 22:53:17 GMT

Adversarial training suffers from robust overfitting, a phenomenon where the robust test accuracy starts to decrease during training. In this paper, we focus on reducing robust overfitting by using common data augmentation schemes. We demonstrate that, contrary to previous findings, when combined with model weight averaging, data augmentation can significantly boost robust accuracy. Furthermore, we compare various data augmentations techniques and observe that spatial composition techniques work best for adversarial training. Finally, we evaluate our approach on CIFAR-10 against ` and `2 norm-bounded perturbations of size = 8/255 and = 128/255, respectively. We show large absolute improvements of +2.93% and +2.16% in robust accuracy compared to previous state-of-the-art methods. In particular, against ` norm-bounded perturbations of size = 8/255, our model reaches 60.07%

artificial intelligence, deep learning, machine learning, (17 more...)

Neural Information Processing Systems

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

d01db5cd2555ba11f75da0454d57b903-Paper-Conference.pdf

Neural Information Processing SystemsFeb-18-2026, 05:41:54 GMT

artificial intelligence, cutmix, machine learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:

Research Report > New Finding (0.92)
Research Report > Experimental Study (0.92)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Vision (0.67)

Add feedback

c917d8b9e01427f3184d80ade22f4d1f-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-17-2026, 01:56:33 GMT

artificial intelligence, ipmix, machine learning, (16 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)

Add feedback

Appendix A Further Empirical Studies

Neural Information Processing SystemsFeb-15-2026, 12:18:31 GMT

As reported in Table A3, PS-MT consistently shows lower distances than Dual Teacher shows. The STD is similarly between 2 and over 50 times smaller. PS-MT's teachers (albeit they may have distinct characteristics) potentially becomes similar distances to the student at each epoch. Comparative analysis of performance based on different CutMix variations. We further report additional quantitative results encompassing three different splits: original high-quality set, blended set, and blended high-quality set .

artificial intelligence, machine learning, semantic segmentation, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

AUnifiedAnalysisofMixedSampleData Augmentation: ALossFunctionPerspective

Neural Information Processing SystemsFeb-12-2026, 13:07:27 GMT

Using the theoretical results, we provide a high-level understanding of howdifferentdesign choices ofMSDAworkdifferently.

artificial intelligence, machine learning, msda, (18 more...)

Neural Information Processing Systems

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

AUnifiedAnalysisofMixedSampleData Augmentation: ALossFunctionPerspective

Neural Information Processing SystemsFeb-12-2026, 13:07:23 GMT

Using the theoretical results, we provide a high-level understanding of howdifferentdesign choices ofMSDAworkdifferently.

artificial intelligence, arxivpreprintarxiv, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

fb4c48608ce8825b558ccf07169a3421-Supplemental.pdf

Neural Information Processing SystemsFeb-12-2026, 00:37:24 GMT

accuracy, augmentation, robust accuracy, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.69)

Add feedback

OntheEffectivenessofLipschitz-DrivenRehearsal inContinualLearning-SupplementaryMaterial

Neural Information Processing SystemsFeb-12-2026, 00:08:31 GMT

If α > β, we are overemphasizing the contribution of the first term of Eq. 9 (which brings each layer'sλk1 andck close toeach other) overthesecond one(which induces small Lipschitz targets).

artificial intelligence, lider, machine learning, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.96)

Add feedback