AITopics | gdw

ADerivation of D1 Denote the logit vector as x, we have pj = exj

Neural Information Processing SystemsApr-26-2026, 13:17:00 GMT

Without zero-mean constraint, the training becomes unstable. Following the training setting of [23], the classifier network is trained with SGD with a weight decay 5e-4, an initial learning rate of 1e-1 and a mini-batch size of 100 for all methods. We use the cosine learning rate decay schedule [49] for a total of 80 epochs. We set the outer level learning ηω as 14 Figure 7: Training curve without zero-mean constraint on CIFAR10 under 40% uniform noise. The MLP weighting network is trained with Adam [51] with a fixed learning rate 1e-3 and a weight decay 1e-4.

artificial intelligence, experiment, machine learning, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

649adc59afdef2a8b9e943f94a04b02f-Paper.pdf

Neural Information Processing SystemsFeb-19-2026, 03:17:24 GMT

But these methods are unable to improve throughput (frames-per-second) on real-life hardware while simultaneously preserving robustness toadversarial perturbations.

artificial intelligence, convolution, machine learning, (19 more...)

Neural Information Processing Systems

Country: North America > United States > Illinois (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

75ebb02f92fc30a8040bbd625af999f1-Supplemental.pdf

Neural Information Processing SystemsFeb-9-2026, 09:54:44 GMT

dataset, epoch, experiment, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Generalized DataWeighting via Class-Level Gradient Manipulation

Neural Information Processing SystemsDec-24-2025, 07:49:15 GMT

Label noise and class imbalance are two major issues coexisting in real-world datasets. To alleviate the two issues, state-of-the-art methods reweight each instance by leveraging a small amount of clean and unbiased data. Yet, these methods overlook class-level information within each instance, which can be further utilized to improve performance. To this end, in this paper, we propose Generalized Data Weighting (GDW) to simultaneously mitigate label noise and class imbalance by manipulating gradients at the class level. To be specific, GDW unrolls the loss gradient to class-level gradients by the chain rule and reweights the flow of each gradient separately.

class-level gradient manipulation, generalized dataweighting, name change, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.80)

Add feedback

Generalized Depthwise-Separable Convolutions for Adversarially Robust and Efficient Neural Networks

Neural Information Processing SystemsDec-24-2025, 05:18:51 GMT

Despite their tremendous successes, convolutional neural networks (CNNs) incur high computational/storage costs and are vulnerable to adversarial perturbations. Recent works on robust model compression address these challenges by combining model compression techniques with adversarial training. But these methods are unable to improve throughput (frames-per-second) on real-life hardware while simultaneously preserving robustness to adversarial perturbations. To overcome this problem, we propose the method of Generalized Depthwise-Separable (GDWS) convolution - an efficient, universal, post-training approximation of a standard 2D convolution. GDWS dramatically improves the throughput of a standard pre-trained network on real-life hardware while preserving its robustness. Lastly, GDWS is scalable to large problem sizes since it operates on pre-trained models and doesn't require any additional training. We establish the optimality of GDWS as a 2D convolution approximator and present exact algorithms for constructing optimal GDWS convolutions under complexity and error constraints. We demonstrate the effectiveness of GDWS via extensive experiments on CIFAR-10, SVHN, and ImageNet datasets. Our code can be found at https://github.com/hsndbk4/GDWS.

adversarially robust, generalized depthwise-separable convolution, robust and efficient neural network, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Generalized Depthwise-Separable Convolutions for Adversarially Robust and Efficient Neural Networks

Neural Information Processing SystemsAug-14-2025, 20:59:10 GMT

But these methods are unable to improve throughput (frames-per-second) on real-life hardware while simultaneously preserving robustness to adversarial perturbations.

convolution, gdw, neural network, (16 more...)

Neural Information Processing Systems

Country: North America > United States > Illinois > Champaign County > Urbana (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Add feedback

Generalized DataWeighting via Class-Level Gradient Manipulation

Neural Information Processing SystemsOct-11-2024, 05:55:37 GMT

Label noise and class imbalance are two major issues coexisting in real-world datasets. To alleviate the two issues, state-of-the-art methods reweight each instance by leveraging a small amount of clean and unbiased data. Yet, these methods overlook class-level information within each instance, which can be further utilized to improve performance. To this end, in this paper, we propose Generalized Data Weighting (GDW) to simultaneously mitigate label noise and class imbalance by manipulating gradients at the class level. To be specific, GDW unrolls the loss gradient to class-level gradients by the chain rule and reweights the flow of each gradient separately.

class-level gradient manipulation, generalized dataweighting, noise and class imbalance, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.85)

Add feedback

Generalized Depthwise-Separable Convolutions for Adversarially Robust and Efficient Neural Networks

Neural Information Processing SystemsOct-10-2024, 19:29:55 GMT

Despite their tremendous successes, convolutional neural networks (CNNs) incur high computational/storage costs and are vulnerable to adversarial perturbations. Recent works on robust model compression address these challenges by combining model compression techniques with adversarial training. But these methods are unable to improve throughput (frames-per-second) on real-life hardware while simultaneously preserving robustness to adversarial perturbations. To overcome this problem, we propose the method of Generalized Depthwise-Separable (GDWS) convolution - an efficient, universal, post-training approximation of a standard 2D convolution. GDWS dramatically improves the throughput of a standard pre-trained network on real-life hardware while preserving its robustness.

adversarially robust, generalized depthwise-separable convolution, robust and efficient neural network, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Filters

Collaborating Authors

gdw

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

ADerivation of D1 Denote the logit vector as x, we have pj = exj

649adc59afdef2a8b9e943f94a04b02f-Paper.pdf

75ebb02f92fc30a8040bbd625af999f1-Supplemental.pdf

Generalized DataWeighting via Class-Level Gradient Manipulation

Generalized Depthwise-Separable Convolutions for Adversarially Robust and Efficient Neural Networks

Generalized Depthwise-Separable Convolutions for Adversarially Robust and Efficient Neural Networks

Generalized DataWeighting via Class-Level Gradient Manipulation

Generalized Depthwise-Separable Convolutions for Adversarially Robust and Efficient Neural Networks