Learning Supervised Topic Models from Crowds

Rodrigues, Filipe (University of Coimbra) | Ribeiro, Bernardete (University of Coimbra) | Lourenço, Mariana (University of Coimbra) | Pereira, Francisco (Massachusetts Institute of Technology)

Nov-1-2015–AAAI Conferences

The growing need to analyze large collections of documents has led to great developments in topic modeling. Since documents are frequently associated with other related variables, such as labels or ratings, much interest has been placed on supervised topic models. However, the nature of most annotation tasks, prone to ambiguity and noise, often with high volumes of documents, deem learning under a single-annotator assumption unrealistic or unpractical for most real-world applications. In this paper, we propose a supervised topic model that accounts for the heterogeneity and biases among different annotators that are encountered in practice when learning from crowds. We develop an efficient stochastic variational inference algorithm that is able to scale to very large datasets, and we empirically demonstrate the advantages of the proposed model over state of the art approaches.

annotator, machine learning, natural language, (17 more...)

AAAI Conferences

Nov-1-2015

Conferences PDF

Add feedback

Country:
- North America > United States
  - Massachusetts > Middlesex County > Cambridge (0.04)
- Europe > Portugal
  - Coimbra > Coimbra (0.04)
- Asia > Middle East
  - Jordan (0.05)

Genre:
- Research Report (0.34)
- Overview (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Statistical Learning (1.00)
  - Natural Language > Discourse & Dialogue (0.92)
  - Representation & Reasoning > Uncertainty
    - Bayesian Inference (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found