Training language models to follow instructions with human feedback Long Ouyang Jeff Wu

Open in new window