Goto

Collaborating Authors

 Reinforcement Learning




Non-MarkovianRewardModellingfromTrajectory LabelsviaInterpretableMultipleInstanceLearning

Neural Information Processing Systems

There is growing consensus around the view that aligned and beneficial AI requires a reframing of objectives as being contingent, uncertain, and learnable via interaction with humans [35].







Quark: ControllableTextGeneration with Reinforced[ Un]learning

Neural Information Processing Systems

Generated text may contain offensive or toxic language, contain significant repetition, orbeofadifferent sentiment than desired by the user. We consider thetaskofunlearningthese misalignments byfine-tuning thelanguage model on signals of whatnot to do.