Seeing The Whole Patient: Using Multi-Label Medical Text Classification Techniques to Enhance Predictions of Medical Codes

Yogarajan, Vithya, Montiel, Jacob, Smith, Tony, Pfahringer, Bernhard

Mar-28-2020–arXiv.org Machine Learning

Machine learning-based multi-label medical text classifications can be used to enhance the understanding of the human body and aid the need for patient care. We present a broad study on clinical natural language processing techniques to maximise a feature representing text when predicting medical codes on patients with multi-morbidity. We present results of multi-label medical text classification problems with 18, 50 and 155 labels. We compare several variations to embeddings, text tagging, and pre-processing. For imbalanced data we show that labels which occur infrequently, benefit the most from additional features incorporated in embeddings. We also show that high dimensional embeddings pre-trained using health-related data present a significant improvement in a multi-label setting, similarly to the way they improve performance for binary classification. High dimensional embeddings from this research are made available for public use.

discharge summary, f-measure, icd-9 code, (16 more...)

arXiv.org Machine Learning

Mar-28-2020

arXiv.org PDF

Add feedback

Country:
- Oceania > New Zealand
  - North Island > Waikato (0.04)
- North America
  - United States (0.04)
  - Canada > Ontario (0.04)
- Asia
  - Middle East > Israel (0.04)
  - Japan (0.04)

Genre:
- Research Report > New Finding (1.00)
- Instructional Material (0.99)

Industry:
- Health & Medicine
  - Therapeutic Area > Oncology (1.00)
  - Health Care Providers & Services (1.00)
  - Health Care Technology > Medical Record (0.68)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Text Processing (0.93)
    - Text Classification (0.92)
  - Machine Learning
    - Statistical Learning (1.00)
    - Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found