AITopics | regularize

Collaborating Authors

regularize

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Learning to Poke by Poking: Experiential Learning of Intuitive Physics

Pulkit Agrawal, Ashvin V. Nair, Pieter Abbeel, Jitendra Malik, Sergey Levine

Neural Information Processing SystemsApr-22-2026, 01:52:32 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, robot, (19 more...)

Neural Information Processing Systems

Country: Europe (0.28)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Learning Structured Sparsity in Deep Neural Networks

Neural Information Processing SystemsMar-17-2026, 07:55:57 GMT

High demand for computation resources severely hinders deployment of large-scale Deep Neural Networks (DNN) in resource constrained devices. In this work, we propose a Structured Sparsity Learning (SSL) method to regularize the structures (i.e., filters, channels, filter shapes, and layer depth) of DNNs. SSL can: (1) learn a compact structure from a bigger DNN to reduce computation cost; (2) obtain a hardware-friendly structured sparsity of DNN to efficiently accelerate the DNN's evaluation. Experimental results show that SSL achieves on average 5.1X and 3.1X speedups of convolutional layer computation of AlexNet against CPU and GPU, respectively, with off-the-shelf libraries. These speedups are about twice speedups of non-structured sparsity; (3) regularize the DNN structure to improve classification accuracy. The results show that for CIFAR-10, regularization on layer depth reduces a 20-layer Deep Residual Network (ResNet) to 18 layers while improves the accuracy from 91.25% to 92.60%, which is still higher than that of original ResNet with 32 layers.

artificial intelligence, machine learning, proceedings, (12 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.64)

Add feedback

Few-shotImageGenerationwith ElasticWeightConsolidation

Neural Information Processing SystemsFeb-19-2026, 06:22:47 GMT

We demonstrate the effectiveness of our algorithm by generating high-quality results of different target domains, including those with extremely few examples (e.g., 10).

artificial intelligence, machine learning, target domain, (17 more...)

Neural Information Processing Systems

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Multi-objects Generation with Amortized Structural Regularization

Taufik Xu, Chongxuan LI, Jun Zhu, Bo Zhang

Neural Information Processing SystemsFeb-13-2026, 08:41:26 GMT

Neural Information Processing Systems http://nips.cc/

constraint, generative model, regularization, (15 more...)

Neural Information Processing Systems

Country:

Asia > China > Beijing > Beijing (0.04)
North America > Canada (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)

Add feedback

ASPiRe: AdaptiveSkillPriorsforReinforcementLearning

Neural Information Processing SystemsFeb-13-2026, 01:55:15 GMT

Transferring prior experience to new tasks is central to an agent's adaptability. In this work, we aim to accelerate online reinforcement learning by leveraging prior experience from large offline data.

aspire, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
Asia > Middle East > Jordan (0.04)

Genre:

Instructional Material (0.34)
Research Report (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.70)

Add feedback

R-Drop: RegularizedDropoutforNeuralNetworks

Neural Information Processing SystemsFeb-8-2026, 20:15:04 GMT

In this paper,we introduce asimple yet more effectivealternativeto regularize the training inconsistencyinduced bydropout, named asR-Drop. Concretely,ineachmini-batch training, eachdata sample goes through the forward pass twice, and each pass isprocessed by adifferent sub model by randomly dropping out some hidden units.

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)

Add feedback

3c057cb2b41f22c0e740974d7a428918-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 07:16:24 GMT

One important limitation of VAEs is the strong prior assumption that latent representations learned by the model followasimpleuni-modal Gaussian distribution. Further,thevariational training procedure poses considerable practical challenges. Recently proposed regularized autoencoders offeradeterministic autoencoding framework, that simplifies the original VAE objective and is significantly easier to train. Since these models only provide weak control over the learned latent distribution, they require an ex-post density estimation step to generate samples comparable to those of VAEs. Inthispaper,wepropose asimple andend-to-end trainable deterministic autoencoding framework, that efficiently shapes the latent space of the model during training and utilizes the capacity of expressive multi-modal latent distributions. The proposed training procedure provides direct evidence if the latent distribution adequately captures complexaspects oftheencoded data.

artificial intelligence, autoencoder, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > United States (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.92)

Add feedback

MoTE: Reconciling Generalization with Specialization for Visual-Language to Video Knowledge Transfer

Neural Information Processing SystemsDec-26-2025, 05:46:32 GMT

Transferring visual-language knowledge from large-scale foundation models for video recognition has proved to be effective. To bridge the domain gap, additional parametric modules are added to capture the temporal information. However, zero-shot generalization diminishes with the increase in the number of specialized parameters, making existing works a trade-off between zero-shot and close-set performance. In this paper, we present MoTE, a novel framework that enables generalization and specialization to be balanced in one unified model. Our approach tunes a mixture of temporal experts to learn multiple task views with various degrees of data fitting. To maximally preserve the knowledge of each expert, we propose Weight Merging Regularization, which regularizes the merging process of experts in weight space.

knowledge management, large language model, reconciling generalization, (9 more...)

Neural Information Processing Systems

Technology:

Information Technology > Visual Languages (0.66)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.54)
Information Technology > Knowledge Management > Knowledge Engineering (0.43)

Add feedback

A Unified Analysis of Mixed Sample Data Augmentation: A Loss Function Perspective

Neural Information Processing SystemsDec-25-2025, 14:07:18 GMT

We propose the first unified theoretical analysis of mixed sample data augmentation (MSDA), such as Mixup and CutMix. Our theoretical results show that regardless of the choice of the mixing strategy, MSDA behaves as a pixel-level regularization of the underlying training loss and a regularization of the first layer parameters. Similarly, our theoretical results support that the MSDA training strategy can improve adversarial robustness and generalization compared to the vanilla training strategy. Using the theoretical results, we provide a high-level understanding of how different design choices of MSDA work differently. For example, we show that the most popular MSDA methods, Mixup and CutMix, behave differently, e.g., CutMix regularizes the input gradients by pixel distances, while Mixup regularizes the input gradients regardless of pixel distances. Our theoretical results also show that the optimal MSDA strategy depends on tasks, datasets, or model parameters. From these observations, we propose generalized MSDAs, a Hybrid version of Mixup and CutMix (HMix) and Gaussian Mixup (GMix), simple extensions of Mixup and CutMix. Our implementation can leverage the advantages of Mixup and CutMix, while our implementation is very efficient, and the computation cost is almost neglectable as Mixup and CutMix. Our empirical study shows that our HMix and GMix outperform the previous state-of-the-art MSDA methods in CIFAR-100 and ImageNet classification tasks.

mixup and cutmix, sample data augmentation, unified analysis, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.39)

Add feedback

Filters

Collaborating Authors

regularize

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

fbd8e65962da06f83f3f28b52774ffd0-Paper-Conference.pdf

Learning to Poke by Poking: Experiential Learning of Intuitive Physics

Learning Structured Sparsity in Deep Neural Networks

Few-shotImageGenerationwith ElasticWeightConsolidation

Multi-objects Generation with Amortized Structural Regularization

ASPiRe: AdaptiveSkillPriorsforReinforcementLearning

R-Drop: RegularizedDropoutforNeuralNetworks

3c057cb2b41f22c0e740974d7a428918-Paper.pdf

MoTE: Reconciling Generalization with Specialization for Visual-Language to Video Knowledge Transfer

A Unified Analysis of Mixed Sample Data Augmentation: A Loss Function Perspective