Calibrating CNNs for Lifelong Learning
–Neural Information Processing Systems
We present an approach for lifelong/continual learning of convolutional neural networks (CNN) that does not suffer from the problem of catastrophic forgetting when moving from one task to the other. We show that the activation maps generated by the CNN trained on the old task can be calibrated using very few calibration parameters, to become relevant to the new task. Based on this, we calibrate the activation maps produced by each network layer using spatial and channel-wise calibration modules and train only these calibration parameters for each new task in order to perform lifelong learning. Our calibration modules introduce significantly less computation and parameters as compared to the approaches that dynamically expand the network. Our approach is immune to catastrophic forgetting since we store the task-adaptive calibration parameters, which contain all the task-specific knowledge and is exclusive to each task.
Neural Information Processing Systems
Oct-11-2024, 03:47:16 GMT
- Genre:
- Instructional Material (0.64)
- Industry:
- Education > Educational Setting > Continuing Education (0.64)
- Technology: