Detecting (Un)answerability in Large Language Models with Linear Directions

Open in new window