Hierarchically Supervised Latent Dirichlet Allocation
–Neural Information Processing Systems
We introduce hierarchically supervised latent Dirichlet allocation (HSLDA), a model for hierarchically and multiply labeled bag-of-word data. Examples of such data include web pages and their placement in directories, product descriptions and associated categories from product hierarchies, and free-text clinical records and their assigned diagnosis codes. Out-of-sample label prediction is the primary goal of this work, but improved lower-dimensional representations of the bagof-word data are also of interest. We demonstrate HSLDA on large-scale data from clinical document labeling and retail product categorization tasks. We show that leveraging the structure from hierarchical labels improves out-of-sample label prediction substantially when compared to models that do not.
Neural Information Processing Systems
Mar-15-2024, 01:44:47 GMT
- Country:
- North America > United States
- New York > New York County
- New York City (0.04)
- California > Santa Clara County
- Palo Alto (0.04)
- New York > New York County
- Asia > Middle East
- Jordan (0.05)
- North America > United States
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Technology: