Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills

Open in new window