Mapping global dynamics of benchmark creation and saturation in artificial intelligence