Robust and Decomposable Average Precision for Image Retrieval

Neural Information Processing Systems 

In image retrieval, standard evaluation metrics rely on score ranking, e.g.