Factorial LDA: Sparse Multi-Dimensional Text Models

Dec-31-2012–Neural Information Processing Systems

Latent variable models can be enriched with a multidimensional structure to consider the many latent factors in a text corpus, such as topic, author perspective and sentiment. We introduce factorial LDA, a multidimensional model in which a document is influenced by K different factors, and each word token depends on a K-dimensional vector of latent variables. Our model incorporates structured word priors and learns a sparse product of factors. Experiments on research abstracts show that our model can learn latent factors such as research topic, scientific discipline, andfocus (methods vs. applications). Our modeling improvements reduce test perplexity and improve human interpretability of the discovered factors.

machine learning, natural language, tuple, (21 more...)

Neural Information Processing Systems

Dec-31-2012

Conferences PDF

Add feedback

Country:
- North America > United States (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning > Learning Graphical Models (0.88)
  - Natural Language > Text Processing (0.87)

Duplicate Docs Excel Report

Title
Factorial LDA: Sparse Multi-Dimensional Text Models

Similar Docs Excel Report more

Title	Similarity	Source
None found