GateON: an unsupervised method for large scale continual learning

Barry, Martin, Bellec, Guillaume, Gerstner, Wulfram

Jun-2-2023–arXiv.org Artificial Intelligence

The objective of continual learning (CL) is to learn tasks sequentially without retraining on earlier tasks. However, when subjected to CL, traditional neural networks exhibit catastrophic forgetting and limited generalization. To overcome these problems, we introduce a novel method called 'Gate and Obstruct Network' (GateON). GateON combines learnable gating of activity and online estimation of parameter relevance to safeguard crucial knowledge from being overwritten. Our method generates partially overlapping pathways between tasks which permits forward and backward transfer during sequential learning. GateON addresses the issue of network saturation after parameter fixation by a re-activation mechanism of fixed neurons, enabling large-scale continual learning. GateON is implemented on a wide range of networks (fully-connected, CNN, Transformers), has low computational complexity, effectively learns up to 100 MNIST learning tasks, and achieves top-tier results for pre-trained BERT in CL-based NLP tasks.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

Jun-2-2023

arXiv.org PDF

Add feedback

Country:
- Oceania > New Zealand (0.04)
- North America > United States
  - Tennessee > Knox County
    - Knoxville (0.04)
  - Minnesota > Hennepin County
    - Minneapolis (0.14)
  - Massachusetts
    - Suffolk County > Boston (0.04)
    - Middlesex County > Cambridge (0.04)
- Europe
  - Austria (0.04)
  - Switzerland > Vaud
    - Lausanne (0.04)
  - Belgium > Flanders
    - East Flanders > Ghent (0.04)

Genre:
- Research Report > New Finding (0.92)

Industry:
- Education (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Cognitive Science (0.93)
  - Machine Learning > Neural Networks
    - Deep Learning (0.93)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found