Efficient Lifelong Model Evaluation in an Era of Rapid Progress