Quark: Controllable Text Generation with Reinforced [ Un]learning Ximing Lu Sean Welleck Jack Hessel