Appendix A Patch based Negative Data Augmentation Reduces Texture Bias

Aug-15-2025, 12:23:12 GMT–Neural Information Processing Systems

Figure 5: ViTs trained only on our patch-based transformations exhibit stronger texture bias. Each bar is the texture accuracy ( %) on Conflict Stimuli (Geirhos et al., 2018), and a higher texture accuracy indicates the model has a higher bias towards texture. The "texture accuracy" is defined as the percentage of images that are classified as the "texture" label, provided the image is classified as either "texture" or "shape" label. The baseline model is ViT -B/16 in (Dosovitskiy et al., 2021) trained on original images. Other models are trained on patch-based transformed images, e.g., "P-Shuffle" stands for a ViT -B/16 model trained on patch-based shuffled images.

artificial intelligence, machine learning, vit-b 16, (14 more...)

Neural Information Processing Systems

Aug-15-2025, 12:23:12 GMT

Conferences PDF

Add feedback

Industry:
- Energy > Oil & Gas
  - Midstream (0.31)
- Materials > Chemicals
  - Commodity Chemicals > Petrochemicals
    - LNG (0.31)
  - Industrial Gases > Liquified Gas (0.31)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.33)

Duplicate Docs Excel Report

Title
67662aa16456e0df65ab001136f92fd0-Supplemental-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found