Active Evaluation Acquisition for Efficient LLM Benchmarking