A safety realignment framework via subspace-oriented model fusion for large language models

Open in new window