On Speeding Up Language Model Evaluation