Learning to summarize from human feedback Nisan Stiennon Long Ouyang Jeff Wu

Neural Information Processing Systems 

As language models become more powerful, training and evaluation are increasingly bottlenecked by the data and metrics used for a particular task.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found