Large expert-curated database for benchmarking document similarity detection in biomedical literature search