Auto-ARGUE: LLM-Based Report Generation Evaluation