Benchmarks as Microscopes: A Call for Model Metrology