Training language models to follow instructions with human feedback Long Ouyang Jeff Wu

Neural Information Processing Systems 

In other words, these models are not aligned with their users.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found