Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking