Goto

Collaborating Authors

 Education






7428e6db752171d6b832c53b2ed297ab-Paper-Conference.pdf

Neural Information Processing Systems

First, we formalize the problem definition.Weintroducetheconceptof"Idon'tknow (idk) responses" and in this context, honesty necessitates that an aligned LLM provides idk responses for unknown questions and correct responses for known questions.