HREF: Human Response-Guided Evaluation of Instruction Following in Language Models

Open in new window