Analytic Subspace Routing: How Recursive Least Squares Works in Continual Learning of Large Language Model

Open in new window