Quark: ControllableTextGeneration with Reinforced[ Un]learning
–Neural Information Processing Systems
Generated text may contain offensive or toxic language, contain significant repetition, orbeofadifferent sentiment than desired by the user. We consider thetaskofunlearningthese misalignments byfine-tuning thelanguage model on signals of whatnot to do.
Neural Information Processing Systems
Feb-11-2026, 09:47:26 GMT
- Country:
- Asia
- China > Hong Kong (0.04)
- Middle East > Jordan (0.04)
- Europe
- North America > United States
- California > San Diego County
- San Diego (0.04)
- Louisiana (0.04)
- Maryland > Baltimore (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Texas > Travis County
- Austin (0.04)
- Washington > King County
- Seattle (0.04)
- California > San Diego County
- Oceania > Australia
- Asia
- Technology: