AITopics | taxnodes:Technology: Instructional Materials

Collaborating Authors

taxnodes:Technology: Instructional Materials

News Overviews Instructional Materials AI-Alerts Classics

Forgetting, Ignorance or Myopia: Revisiting Key Challenges in Online Continual Learning

Neural Information Processing SystemsMar-21-2025, 23:44:22 GMT

Online continual learning (OCL) requires the models to learn from constant, endless streams of data. While significant efforts have been made in this field, most were focused on mitigating the catastrophic forgetting issue to achieve better classification ability, at the cost of a much heavier training workload. They overlooked that in real-world scenarios, e.g., in high-speed data stream environments, data do not pause to accommodate slow models. In this paper, we emphasize that model throughput-defined as the maximum number of training samples that a model can process within a unit of time - is equally important. It directly limits how much data a model can utilize and presents a challenging dilemma for current methods. With this understanding, we revisit key challenges in OCL from both empirical and theoretical perspectives, highlighting two critical issues beyond the well-documented catastrophic forgetting: (i) Model's ignorance: the single-pass nature of OCL challenges models to learn effective features within constrained training time and storage capacity, leading to a trade-off between effective learning and model throughput; (ii) Model's myopia: the local learning nature of OCL on the current task leads the model to adopt overly simplified, task-specific features and excessively sparse classifier, resulting in the gap between the optimal solution for the current task and the global objective. To tackle these issues, we propose the Non-sparse Classifier Evolution framework (NsCE) to facilitate effective global discriminative feature learning with minimal time cost. NsCE integrates non-sparse maximum separation regularization and targeted experience replay techniques with the help of pre-trained models, enabling rapid acquisition of new globally discriminative features. Extensive experiments demonstrate the substantial improvements of our framework in performance, throughput and real-world practicality.

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country: Asia > China (0.14)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Instructional Material > Online (0.70)

Industry:

Information Technology (1.00)
Education > Educational Setting > Online (0.68)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

3d4c0a618d0acd7921493e4f30395c22-Paper-Conference.pdf

Neural Information Processing SystemsMar-21-2025, 15:29:27 GMT

detection, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (0.46)
North America > Canada > Quebec (0.14)

Genre:

Research Report (0.68)
Instructional Material (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(3 more...)

Add feedback

Learning via Surrogate PAC-Bayes, France SUEZ

Neural Information Processing SystemsMar-21-2025, 11:42:20 GMT

PAC-Bayes learning is a comprehensive setting for (i) studying the generalisation ability of learning algorithms and (ii) deriving new learning algorithms by optimising a generalisation bound. However, optimising generalisation bounds might not always be viable for tractable or computational reasons, or both. For example, iteratively querying the empirical risk might prove computationally expensive. In response, we introduce a novel principled strategy for building an iterative learning algorithm via the optimisation of a sequence of surrogate training objectives, inherited from PAC-Bayes generalisation bounds. The key argument is to replace the empirical risk (seen as a function of hypotheses) in the generalisation bound by its projection onto a constructible low dimensional functional space: these projections can be queried much more efficiently than the initial risk. On top of providing that generic recipe for learning via surrogate PAC-Bayes bounds, we (i) contribute theoretical results establishing that iteratively optimising our surrogates implies the optimisation of the original generalisation bounds, (ii) instantiate this strategy to the framework of meta-learning, introducing a meta-objective offering a closed form expression for meta-gradient, (iii) illustrate our approach with numerical experiments inspired by an industrial biochemical problem.

artificial intelligence, machine learning, pac-bayes, (20 more...)

Neural Information Processing Systems

Country:

Europe (0.64)
North America > United States > New York (0.14)

Genre:

Research Report > Experimental Study (1.00)
Instructional Material (0.66)

Industry:

Government (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Pseudo-Spherical Contrastive Divergence Jiaming Song Computer Science Department Computer Science Department Stanford University

Neural Information Processing SystemsMar-21-2025, 11:05:16 GMT

However, due to the intractable partition function, they are typically trained via contrastive divergence for maximum likelihood estimation. In this paper, we propose pseudo-spherical contrastive divergence (PS-CD) to generalize maximum likelihood learning of EBMs. PS-CD is derived from the maximization of a family of strictly proper homogeneous scoring rules, which avoids the computation of the intractable partition function and provides a generalized family of learning objectives that include contrastive divergence as a special case. Moreover, PS-CD allows us to flexibly choose various learning objectives to train EBMs without additional computational cost or variational minimax optimization. Theoretical analysis on the proposed method and extensive experiments on both synthetic data and commonly used image datasets demonstrate the effectiveness and modeling flexibility of PS-CD, as well as its robustness to data contamination, thus showing its superiority over maximum likelihood and f-EBMs.

artificial intelligence, arxiv preprint arxiv, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Europe (0.28)
Asia > Middle East (0.28)

Genre:

Instructional Material (0.68)
Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Neural Information Processing SystemsMar-21-2025, 07:58:23 GMT

Autonomous agents that accomplish complex computer tasks with minimal human interventions can significantly enhance accessibility and productivity of humancomputer interactions. Existing benchmarks either lack interactive environments or are limited to specific applications/domains, failing to reflect the diversity and complexity of real-world computer use and limiting agent scalability.

large language model, machine learning, natural language, (22 more...)

Neural Information Processing Systems

Country: Asia (0.28)

Genre:

Workflow (0.95)
Instructional Material > Course Syllabus & Notes (0.46)

Industry:

Law (1.00)
Education > Educational Setting > Online (0.93)
Information Technology > Software (0.69)
Education > Educational Technology (0.67)

Technology:

Information Technology > Software (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Communications > Mobile (1.00)
(8 more...)

Add feedback

b5b1d9ada94bb80609d21eecf7a2ce7a-Paper.pdf

Neural Information Processing SystemsMar-21-2025, 07:36:04 GMT

algorithm, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country: Asia > China > Hong Kong (0.15)

Genre: Instructional Material (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.94)

Add feedback

b5488aeff42889188d03c9895255cecc-Paper.pdf

Neural Information Processing SystemsMar-21-2025, 06:38:29 GMT

learner, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Country: Europe (0.28)

Genre: Instructional Material > Course Syllabus & Notes (0.46)

Industry: Education > Educational Setting (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

3a48b0eaba26ba862220a307a9edb0bb-Paper-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsMar-21-2025, 06:37:08 GMT

artificial intelligence, machine learning, video, (20 more...)

Neural Information Processing Systems

Country: Asia > China (0.14)

Genre:

Instructional Material (0.69)
Research Report > New Finding (0.46)

Industry:

Education (1.00)
Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (0.67)
Information Technology > Security & Privacy (0.67)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Security & Privacy (0.67)
Information Technology > Artificial Intelligence > Natural Language (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Generative vs Discriminative: Rethinking The Meta-Continual Learning

Neural Information Processing SystemsMar-21-2025, 04:24:41 GMT

Deep neural networks have achieved human-level capabilities in various learning tasks. However, they generally lose performance in more realistic scenarios like learning in a continual manner. In contrast, humans can incorporate their prior knowledge to learn new concepts efficiently without forgetting older ones. In this work, we leverage meta-learning to encourage the model to learn how to learn continually. Inspired by human concept learning, we develop a generative classifier that efficiently uses data-driven experience to learn new concepts even from few samples while being immune to forgetting. Along with cognitive and theoretical insights, extensive experiments on standard benchmarks demonstrate the effectiveness of the proposed method. The ability to remember all previous concepts, with negligible computational and structural overheads, suggests that generative models provide a natural way for alleviating catastrophic forgetting, which is a major drawback of discriminative models. The code is publicly available at https://github.com/aminbana/GeMCL.

artificial intelligence, bayesian inference, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America (0.28)

Genre: Instructional Material (0.48)

Industry:

Education (0.47)
Health & Medicine (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

Dealing with Synthetic Data Contamination in Online Continual Learning Maorong Wang Nicolas Michel 1,2 Jiafeng Mao 1

Neural Information Processing SystemsMar-21-2025, 00:59:42 GMT

Image generation has shown remarkable results in generating high-fidelity realistic images, in particular with the advancement of diffusion-based models. However, the prevalence of AI-generated images may have side effects for the machine learning community that are not clearly identified. Meanwhile, the success of deep learning in computer vision is driven by the massive dataset collected on the Internet. The extensive quantity of synthetic data being added to the Internet would become an obstacle for future researchers to collect "clean" datasets without AI-generated content. Prior research has shown that using datasets contaminated by synthetic images may result in performance degradation when used for training. In this paper, we investigate the potential impact of contaminated datasets on Online Continual Learning (CL) research. We experimentally show that contaminated datasets might hinder the training of existing online CL methods. Also, we propose Entropy Selection with Real-synthetic similarity Maximization (ESRM), a method to alleviate the performance deterioration caused by synthetic images when training online CL models. Experiments show that our method can significantly alleviate performance deterioration, especially when the contamination is severe.

artificial intelligence, deep learning, machine learning, (18 more...)

Neural Information Processing Systems

Genre: