K-ON: Stacking Knowledge On the Head Layer of Large Language Model

Open in new window