Large expert-curated database for benchmarking document similarity detection in biomedical literature search
Document recommendation systems for locating relevant literature have mostly relied on methods developed a decade ago. This is largely due to the lack of a large offline gold-standard benchmark of relevant documents that cover a variety of research fields such that newly developed literature search techniques can be compared, improved and translated into practice. To overcome this bottleneck, we have established the RElevant LIterature SearcH consortium consisting of more than 1500 scientists from 84 countries, who have collectively annotated the relevance of over 180 000 PubMed-listed articles with regard to their respective seed (input) article/s. The majority of annotations were contributed by highly experienced, original authors of the seed articles. The collected data cover 76% of all unique PubMed Medical Subject Headings descriptors. No systematic biases were observed across different experience levels, research fields or time spent on annotations.
Nov-23-2019, 10:46:43 GMT
- Country:
- Africa
- South Africa > Gauteng (0.14)
- Ethiopia (0.14)
- Niger (0.14)
- Sudan (0.14)
- Middle East > Egypt (0.67)
- Botswana (0.14)
- The Gambia (0.14)
- Nigeria (0.67)
- Mozambique (0.14)
- Malawi (0.14)
- Senegal (0.14)
- Cameroon (0.14)
- Asia
- Pakistan (0.14)
- Nepal (0.14)
- Malaysia (0.14)
- Vietnam (0.14)
- Japan
- Hokkaidō (0.14)
- Honshū
- Chūbu > Aichi Prefecture (0.14)
- Kansai (0.45)
- Kantō
- Kanagawa Prefecture (0.14)
- Tokyo Metropolis Prefecture > Tokyo (0.14)
- Tōhoku (0.28)
- Kyūshū & Okinawa > Kyūshū (0.14)
- Indonesia (0.14)
- Middle East
- Iran (0.46)
- Israel > Mediterranean Sea (0.34)
- Lebanon (0.14)
- Qatar (0.14)
- Republic of Türkiye (0.67)
- Saudi Arabia > Eastern Province
- Al-Ahsa Governorate > Al-Hofuf (0.14)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.14)
- Russia (0.14)
- China
- Fujian Province (0.14)
- Anhui Province (0.14)
- Hubei Province (0.14)
- Sichuan Province (0.14)
- Guangdong Province (0.47)
- Henan Province (0.14)
- Heilongjiang Province (0.14)
- Shandong Province (0.14)
- Hunan Province (0.14)
- Liaoning Province (0.14)
- Zhejiang Province (0.14)
- Jiangsu Province (0.14)
- Guizhou Province (0.14)
- South Korea (0.46)
- Taiwan (0.28)
- Sri Lanka (0.14)
- India > NCT (0.14)
- Europe
- Hungary (0.14)
- United Kingdom
- England
- Greater Manchester (0.28)
- Leicestershire (0.14)
- Devon (0.14)
- Greater London > London (0.15)
- Oxfordshire > Oxford (0.14)
- Merseyside > Liverpool (0.14)
- Nottinghamshire > Nottingham (0.14)
- Dorset (0.14)
- Cambridgeshire > Cambridge (0.14)
- West Midlands (0.14)
- Scotland (0.14)
- Wales (0.27)
- England
- Ireland > Leinster
- County Dublin > Dublin (0.14)
- Czechia (0.28)
- Sweden
- Skåne County (0.28)
- Vaestra Goetaland > Gothenburg (0.14)
- Ukraine (0.27)
- Russia > Northwestern Federal District
- Leningrad Oblast > Saint Petersburg (0.14)
- Romania (0.14)
- Belgium > Flanders (0.28)
- Greece (0.46)
- Middle East > Cyprus (0.14)
- Italy
- Campania (0.14)
- Piedmont > Turin Province
- Turin (0.14)
- France
- Auvergne-Rhône-Alpes (0.14)
- Bourgogne-Franche-Comté (0.27)
- Occitanie (0.14)
- Provence-Alpes-Côte d'Azur (0.14)
- Serbia (0.14)
- Portugal > Lisbon
- Lisbon (0.14)
- Norway (0.68)
- Finland (0.93)
- Switzerland
- Denmark > Capital Region
- Copenhagen (0.14)
- Iceland (0.14)
- Netherlands
- Gelderland (0.14)
- South Holland (0.28)
- Spain
- Germany
- Baden-Württemberg (0.46)
- Bavaria > Upper Bavaria (0.14)
- Hesse > Darmstadt Region
- Frankfurt (0.14)
- Lower Saxony > Gottingen (0.14)
- North Rhine-Westphalia
- Cologne Region (0.28)
- Düsseldorf Region > Düsseldorf (0.14)
- Münster Region > Münster (0.14)
- Saarland (0.14)
- Schleswig-Holstein (0.14)
- Poland (1.00)
- Liechtenstein (0.14)
- Bulgaria (0.14)
- Austria > Vienna (0.14)
- North America
- Bermuda (0.14)
- Canada
- Alberta > Census Division No. 2
- Lethbridge County > Lethbridge (0.14)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.14)
- Manitoba (0.14)
- Nova Scotia > Halifax Regional Municipality
- Dartmouth (0.14)
- Ontario
- Hamilton (0.14)
- National Capital Region > Ottawa (0.28)
- Toronto (0.14)
- Quebec > Montreal (0.14)
- Saskatchewan (0.27)
- Alberta > Census Division No. 2
- Mexico (0.14)
- Puerto Rico (0.14)
- United States
- Colorado > Boulder County
- Boulder (0.14)
- California
- Los Angeles County > Los Angeles (0.28)
- Merced County > Merced (0.14)
- Orange County > Irvine (0.14)
- Riverside County > Riverside (0.14)
- San Diego County (0.14)
- San Francisco County > San Francisco (0.28)
- Santa Clara County (0.14)
- Massachusetts
- Hampshire County > Amherst (0.14)
- Middlesex County (0.14)
- Vermont > Chittenden County
- Burlington (0.14)
- Mississippi (0.28)
- Michigan > Ingham County (0.14)
- Washington > King County
- Seattle (0.14)
- Virginia
- Albemarle County > Charlottesville (0.14)
- Fairfax County (0.14)
- Connecticut > Tolland County
- Storrs (0.14)
- Nebraska
- Douglas County > Omaha (0.14)
- Lancaster County > Lincoln (0.14)
- Alabama > Tuscaloosa County
- Tuscaloosa (0.14)
- Ohio (0.28)
- New Mexico (0.14)
- Tennessee (0.14)
- Iowa > Johnson County
- Iowa City (0.14)
- Illinois > Cook County (0.14)
- Kentucky > Fayette County
- Lexington (0.14)
- North Carolina (0.46)
- Pennsylvania > Philadelphia County
- Philadelphia (0.14)
- Kansas (0.28)
- Maryland
- Baltimore (0.14)
- Montgomery County (0.14)
- Oregon (0.14)
- New York
- Albany County > Albany (0.14)
- New York County > New York City (0.14)
- Wisconsin
- Dane County > Madison (0.14)
- Milwaukee County > Wauwatosa (0.14)
- Missouri
- Boone County > Columbia (0.14)
- Jackson County > Kansas City (0.14)
- St. Louis County > St. Louis (0.14)
- Arizona > Pima County
- Tucson (0.14)
- Montana > Missoula County
- Missoula (0.14)
- Georgia > Clarke County
- Athens (0.14)
- Indiana > Tippecanoe County (0.14)
- Texas > Dallas County (0.14)
- Florida
- Alachua County > Gainesville (0.14)
- Hillsborough County > Tampa (0.14)
- Arkansas (0.14)
- Minnesota > Hennepin County
- Minneapolis (0.27)
- Colorado > Boulder County
- Oceania
- Australia
- New South Wales (0.28)
- Queensland (0.29)
- South Australia (0.14)
- Tasmania > Hobart (0.14)
- Western Australia (0.14)
- New Caledonia (0.14)
- New Zealand
- North Island (0.14)
- South Island (0.14)
- Australia
- South America
- Argentina (0.14)
- Brazil
- Bahia (0.14)
- Goiás (0.14)
- Santa Catarina (0.14)
- Chile (0.28)
- Colombia > Bogotá D.C. (0.14)
- Peru (0.14)
- Africa
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Research Report
- Industry:
- Education > Educational Setting
- Higher Education (1.00)
- Energy > Oil & Gas (0.92)
- Food & Agriculture > Agriculture (1.00)
- Government > Regional Government
- Health & Medicine
- Diagnostic Medicine > Imaging (0.93)
- Pharmaceuticals & Biotechnology (1.00)
- Nuclear Medicine (1.00)
- Public Health (1.00)
- Epidemiology (0.93)
- Therapeutic Area
- Cardiology/Vascular Diseases (1.00)
- Immunology (1.00)
- Infections and Infectious Diseases (1.00)
- Neurology (1.00)
- Oncology (1.00)
- Pediatrics/Neonatology (0.93)
- Psychiatry/Psychology (1.00)
- Health Care Providers & Services (1.00)
- Surgery (1.00)
- Consumer Health (1.00)
- Education > Educational Setting
- Technology: