PROXYQA: An Alternative Framework for Evaluating Long-Form Text Generation with Large Language Models