Unlearning as multi-task optimization: A normalized gradient difference approach with an adaptive learning rate