Goto

Collaborating Authors

 Education


IDGen: ItemDiscriminationInduced PromptGenerationforLLMEvaluation

Neural Information Processing Systems

Item Discrimination (ID) theory, which is widely used in educational assessment, measures the ability of individual test items to differentiate between high and low performers. Inspired by this theory, wepropose anID-induced prompt synthesis frameworkforevaluating LLMs to ensure the evaluation set can continually update and refine according to model abilities.