Goto

Collaborating Authors

 Machine Translation




Quark: ControllableTextGeneration with Reinforced[ Un]learning

Neural Information Processing Systems

Generated text may contain offensive or toxic language, contain significant repetition, orbeofadifferent sentiment than desired by the user. We consider thetaskofunlearningthese misalignments byfine-tuning thelanguage model on signals of whatnot to do.


3979818cdc7bc8dbeec87170c11ee340-Paper-Conference.pdf

Neural Information Processing Systems

Self-supervised large language models have demonstrated the ability to perform various tasks via in-context learning, but little is known about where the model locates the task with respect to prompt instructions and demonstration examples. In this work, we attempt to characterize the region where large language models transition from recognizing the task to performing the task.