High Recall Text Classification for Public Health Systematic Review

McNamee, Paul (Johns Hopkins University) | Mayfield, James (Johns Hopkins University) | Rowe, Samantha Y. (U.S. Centers for Disease Control and Prevention) | Rowe, Alexander K. (U.S. Centers for Disease Control and Prevention) | Jackson, Hannah L. (U.S. Centers for Disease Control and Prevention) | Baker, Megan (Johns Hopkins University)

AAAI Conferences 

Some information retrieval applications demand manageable levels of precision at high levels of recall. Examples include e-discovery, patent search, and systematic review. In this paper we present a real-world case study supporting a broad topic systematic review in the public health domain. We provide experimental results that demonstrate how retrieval performance on bibliographic citations can be materially improved. We attained an average precision of 0.57 and recall approaching 80% at a very reasonable screening depth. These results represent 18% and 23% relative gains over a baseline classifier. We also address pragmatic issues that arise when working on “noisy” real-world data, such as coping with citation records that often have empty fields.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found