Experimenting with Drugs (and Topic Models): Multi-Dimensional Exploration of Recreational Drug Discussions
Paul, Michael J. (Johns Hopkins University) | Dredze, Mark (Johns Hopkins University)
Clinical research of new recreational drugs and trends requires mining current information from non-traditional text sources. In this work we support such research through the use of multi-dimensional latent text models, such as factorial LDA, that capture orthogonal factors of corpora, creating structured output for researchers to better understand the contents of a corpus. Since a purely unsupervised model is unlikely to discover specific factors of interests to clinical researchers, we modify the structure of factorial LDA to incorporate prior knowledge, including the use of of observed variables, informative priors and background components. The resulting model learns factors that correspond to drug type, delivery method (smoking, injection, etc.), and aspect (chemistry, culture, effects, health, usage). We demonstrate that the improved model yields better quantitative and more interpretable results.
Nov-5-2012
- Country:
- North America > United States (0.28)
- Genre:
- Research Report
- Experimental Study (0.48)
- New Finding (0.48)
- Research Report
- Technology: