Learning to summarize from human feedback Nisan Stiennon Long Ouyang Jeff Wu

Open in new window