Learning to summarize from human feedback Nisan Stiennon Long Ouyang Jeff Wu
–Neural Information Processing Systems
As language models become more powerful, training and evaluation are increasingly bottlenecked by the data and metrics used for a particular task.
Neural Information Processing Systems
Oct-2-2025, 09:57:04 GMT