NoveltyBench: Evaluating Language Models for Humanlike Diversity

Open in new window