Adversarial Attacks on Black Box Video Classifiers: Leveraging the Power of Geometric Transformations (Supplementary Material)

Apr-24-2026, 17:52:58 GMT–Neural Information Processing Systems

We observe that our method outperforms the baseline methods in a statistically significant way. We consider four state-of-the-art video classification models, representing diverse methodologies of learning from videos, i.e., C3D [1], SlowFast [2], TPN [3] and I3D [4], as our black-box victim models to perform adversarial attack. The C3D model applies 3D convolution to learn spatio-temporal features from videos. SlowFast uses a two-pathway architecture where the slow pathway operates at a low frame rate to capture spatial semantics and the fast pathway operates at a high frame rate to capture motion at fine temporal resolution. I3D proposes the Inflated 3DConvNet(I3D) with Inflated 2D filters and pooling kernels of traditional 2DCNNs.

artificial intelligence, geo-trap, machine learning, (16 more...)

Neural Information Processing Systems

Apr-24-2026, 17:52:58 GMT

Conferences PDF

Add feedback

Country:
- North America > United States > California > Riverside County > Riverside (0.14)

Industry:
- Information Technology > Security & Privacy (0.86)
- Government > Military (0.72)
- Transportation > Air (0.62)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (0.67)
  - Vision > Video Understanding (0.34)

Duplicate Docs Excel Report

Title
SupplementaryMaterial

Similar Docs Excel Report more

Title	Similarity	Source
None found