Prediction-Constrained Topic Models for Antidepressant Recommendation
Hughes, Michael C., Hope, Gabriel, Weiner, Leah, McCoy, Thomas H., Perlis, Roy H., Sudderth, Erik B., Doshi-Velez, Finale
Supervisory signals can help topic models discover low-dimensional data representations that are more interpretable for clinical tasks. We propose a framework for training supervised latent Dirichlet allocation that balances two goals: faithful generative explanations of high-dimensional data and accurate prediction of associated class labels. Existing approaches fail to balance these goals by not properly handling a fundamental asymmetry: the intended task is always predicting labels from data, not data from labels. Our new prediction-constrained objective trains models that predict labels from heldout data well while also producing good generative likelihoods and interpretable topic-word parameters. In a case study on predicting depression medications from electronic health records, we demonstrate improved recommendations compared to previous supervised topic models and high- dimensional logistic regression from words alone.
Dec-1-2017
- Country:
- North America > United States (0.46)
- Genre:
- Research Report > New Finding (0.66)
- Industry:
- Health & Medicine
- Pharmaceuticals & Biotechnology (1.00)
- Health Care Providers & Services (0.93)
- Health Care Technology > Medical Record (0.69)
- Therapeutic Area
- Psychiatry/Psychology (1.00)
- Immunology (1.00)
- Infections and Infectious Diseases (0.94)
- Vaccines (0.69)
- Health & Medicine