AITopics | sequential training

Collaborating Authors

sequential training

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Module-wise Training of Neural Networks via the Minimizing Movement Scheme Anonymous Author(s) Affiliation Address email

Neural Information Processing SystemsFeb-16-2026, 08:39:47 GMT

End-to-end backpropagation is the standard training method of neural networks.

artificial intelligence, machine learning, module, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)
Europe > Switzerland > Basel-City > Basel (0.04)

Genre: Research Report (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Module-wise Training of Neural Networks via the Minimizing Movement Scheme

Neural Information Processing SystemsFeb-16-2026, 08:39:43 GMT

End-to-end backpropagation is the standard training method of neural networks.

artificial intelligence, machine learning, module, (18 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.04)
North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
Europe > Switzerland > Basel-City > Basel (0.04)

Genre: Research Report (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Experience Replay for Continual Learning

David Rolnick, Arun Ahuja, Jonathan Schwarz, Timothy Lillicrap, Gregory Wayne

Neural Information Processing SystemsFeb-15-2026, 06:43:37 GMT

Neural Information Processing Systems http://nips.cc/

clear, continual learning, learning, (17 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Greater London > London (0.05)
North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
North America > Canada (0.04)

Industry: Education (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Organizingrecurrentnetworkdynamicsby task-computationtoenablecontinuallearning

Neural Information Processing SystemsFeb-9-2026, 16:27:40 GMT

Here, we develop a novel learning rule designed to minimize interferencebetween sequentially learned tasksinrecurrent networks.

artificial intelligence, machine learning, subspace, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Santa Clara County > Stanford (0.05)
North America > United States > California > Santa Clara County > Palo Alto (0.05)
Europe > United Kingdom > England > Greater London > London (0.05)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Elastic Weight Consolidation for Knowledge Graph Continual Learning: An Empirical Evaluation

Jhajj, Gaganpreet, Lin, Fuhua

arXiv.org Artificial IntelligenceDec-2-2025

Knowledge graphs (KGs) require continual updates as new information emerges, but neural embedding models suffer from catastrophic forgetting when learning new tasks sequentially. We evaluate Elastic Weight Consolidation (EWC), a regularization-based continual learning method, on KG link prediction using TransE embeddings on FB15k-237. Across multiple experiments with five random seeds, we find that EWC reduces catastrophic forgetting from 12.62% to 6.85%, a 45.7% reduction compared to naive sequential training. We observe that the task partitioning strategy affects the magnitude of forgetting: relation-based partitioning (grouping triples by relation type) exhibits 9.8 percentage points higher forgetting than randomly partitioned tasks (12.62% vs 2.81%), suggesting that task construction influences evaluation outcomes. While focused on a single embedding model and dataset, our results demonstrate that EWC effectively mitigates catastrophic forgetting in KG continual learning and highlight the importance of evaluation protocol design.

large language model, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

2512.0189

Country: North America > United States > New York > New York County > New York City (0.14)

Genre: Research Report > New Finding (0.86)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.64)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.47)

Add feedback

Continual Learning with Deep Generative Replay

Hanul Shin, Jung Kwon Lee, Jaehong Kim, Jiwon Kim

Neural Information Processing SystemsNov-21-2025, 14:17:59 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, generative replay, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Oceania > New Zealand (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)

Industry: Health & Medicine > Therapeutic Area (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)
Information Technology > Artificial Intelligence > Cognitive Science (0.94)

Add feedback

Module-wise Training of Neural Networks via the Minimizing Movement Scheme Anonymous Author(s) Affiliation Address email

Neural Information Processing SystemsOct-9-2025, 03:51:45 GMT

End-to-end backpropagation is the standard training method of neural networks.

artificial intelligence, machine learning, module, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)
Europe > Switzerland > Basel-City > Basel (0.04)

Genre: Research Report (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Module-wise Training of Neural Networks via the Minimizing Movement Scheme

Neural Information Processing SystemsOct-9-2025, 03:51:41 GMT

End-to-end backpropagation is the standard training method of neural networks.

artificial intelligence, machine learning, module, (18 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.04)
North America > United States > Utah > Salt Lake County > Salt Lake City (0.04)
Europe > Switzerland > Basel-City > Basel (0.04)

Genre: Research Report (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Experience Replay for Continual Learning

David Rolnick, Arun Ahuja, Jonathan Schwarz, Timothy Lillicrap, Gregory Wayne

Neural Information Processing SystemsAug-20-2025, 10:29:35 GMT

Neural Information Processing Systems http://nips.cc/

buffer, continual learning, learning, (17 more...)

Neural Information Processing Systems

Country:

Europe > United Kingdom > England > Greater London > London (0.05)
North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
North America > United States > New York (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Industry: Education (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

KDH-MLTC: Knowledge Distillation for Healthcare Multi-Label Text Classification

Sakai, Hajar, Lam, Sarah S.

arXiv.org Artificial IntelligenceMay-13-2025

The increasing volume of healthcare textual data requires computationally efficient, yet highly accurate classification approaches able to handle the nuanced and complex nature of medical terminology. This research presents Knowledge Distillation for Healthcare Multi - Label Text Classification (KDH - MLTC), a framework leveraging model compr ession and Large Language Models (LLMs). The proposed approach addresses conventional healthcare Multi - Label Text Classification (MLTC) challenges by integrating knowledge distillation and sequential fine - tuning, subsequently optimized through Particle Swa rm Optimization (PSO) for hyperparameter tuning. KDH - MLTC transfers knowledge from a more complex teacher LLM ( i.e., BERT) to a lighter student LLM ( i.e., DistilBERT) through sequential training adapted to MLTC that preserves the teacher's learned information while significantly reducing computational requirements. As a result, the classification is enabled to be conducted locally, making it suitable for healthcare textual data characterized by sensitivity and, therefore, ensuring HIPAA compliance. The e xpe riments conducted on three medical literature datasets of different sizes, sampled from the Hallmark of Cancer (HoC) dataset, demonstrate that KDH - MLTC achieves superior performance compared to existing approaches, particularly for the largest dataset, reaching an F1 score of 82.70% 0.89%. Additionally, statistical validation and an ablation study ar e carried out, proving the robustness of KDH - MLTC. Furthermore, the PSO - based hyperparameter optimization process allow ed the identification of optimal configurations. The proposed approach contributes to healthcare text classification research, balancing efficiency requirements in resource - constrained healthcare settings with satisfactory accuracy demands.

kdh, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2505.07162

Country: North America > United States (0.28)

Genre: Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Therapeutic Area > Oncology (0.46)

Technology: