Learning to summarize from human feedback

Open in new window