PROXYQA: An Alternative Framework for Evaluating Long-Form Text Generation with Large Language Models

Open in new window