On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting

Neural Information Processing Systems 

While in some applications the goal is to "nudge" the pre-trained

Similar Docs  Excel Report  more

TitleSimilaritySource
None found