Classification of cancer pathology reports: a large-scale comparative study