7428e6db752171d6b832c53b2ed297ab-Paper-Conference.pdf

Neural Information Processing Systems 

First, we formalize the problem definition.Weintroducetheconceptof"Idon'tknow (idk) responses" and in this context, honesty necessitates that an aligned LLM provides idk responses for unknown questions and correct responses for known questions.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found