Goto

Collaborating Authors

 activation map


FFAM: Feature Factorization Activation Map for Explanation of 3D Detectors

Neural Information Processing Systems

LiDAR-based 3D object detection has made impressive progress recently, yet most existing models are black-box, lacking interpretability. Previous explanation approaches primarily focus on analyzing image-based models and are not readily applicable to LiDAR-based 3D detectors. In this paper, we propose a feature factorization activation map (FFAM) to generate high-quality visual explanations for 3D detectors. FFAM employs non-negative matrix factorization to generate concept activation maps and subsequently aggregates these maps to obtain a global visual explanation. To achieve object-specific visual explanations, we refine the global visual explanation using the feature gradient of a target object. Additionally, we introduce a voxel upsampling strategy to align the scale between the activation map and input point cloud. We qualitatively and quantitatively analyze FFAM with multiple detectors on several datasets. Experimental results validate the high-quality visual explanations produced by FFAM.


ebd9629fc3ae5e9f6611e2ee05a31cef-Supplemental.pdf

Neural Information Processing Systems

Dataset (1)consists ofvarious lines in the image at a discrete set of angles, and the classification task is to detect the angle of 14 the line. Some images from the test set ofclasses 80 and 100 are multiplied with apermutation matrix to randomly permute rows and columns.


Activation Map Compression through Tensor Decomposition for Deep Learning

Neural Information Processing Systems

The application of low-order decomposition results in considerable memory savings while preserving the features essential for learning, and also offers theoretical guarantees to convergence.


Appendix for " CS-Isolate: Extracting Hard Confident Examples by Content and Style Isolation " Y exiong Lin 1 Y u Y ao

Neural Information Processing Systems

We denote observed variables with gray color and latent variables with white color. Firstly, we introduce the concept of an uncontrolled style factor . Why do confident examples encourage content-style isolation? Calculate the loss using Eq. 1 and update networks; Output: The inference networks and classifier heads q It's essential to understand that although data augmentation cannot control all style factors, it still offers the benefit of "partial isolation". This approach, therefore, ensures that styles changes don't affect the derived content representation Calculate the loss using Eq. 2 and update networks; Output: The inference networks and classifier heads q Finally, confident and unlabeled examples are used to train the models based on the MixMatch algorithm.





Predicts HumanVisualSelectivity

Neural Information Processing Systems

The 1For our experiments we are counting the number of AMTHuman Intelligence Tasks (HITs) that were completed. Wedid not exclude AMT workers from completing multiple HITs. The authors posit that this noisiness is because the gradient may fluctuate sharply at small scales, which seems plausible especially given that, duetoReLUactivationfunctions, theoutput generally isnotevencontinuously differentiable. ThisCAM indicates the discriminative regions of the image used by the CNN to identify that class. We used each of the above passive attention methods to acquire attention maps from each of the modelsinthetoppartofTable2.


TREC: TransientRedundancy Elimination-based Convolution

Neural Information Processing Systems

Convolutional Neural Networks (CNNs) are computation intensive, making their deployment on resource-constrained devices (e.g., Microcontrollers equipped with 2MB memory) challenging.