Discriminative Learning for Label Sequences via Boosting

Altun, Yasemin, Hofmann, Thomas, Johnson, Mark

Dec-31-2003–Neural Information Processing Systems

Well-known applications include part-of-speech (POS) tagging, named entity classification, information extraction,text segmentation and phoneme classification in text and speech processing [7] as well as problems like protein homology detection, secondary structure prediction or gene classification in computational biology [3]. Up to now, the predominant formalism for modeling and predicting label sequences has been based on Hidden Markov Models (HMMs) and variations thereof. Yet, despite its success, generative probabilistic models - of which HMMs are a special case - have two major shortcomings, which this paper is not the first one to point out. First, generative probabilistic models are typically trained using maximum likelihood estimation (MLE) for a joint sampling model of observation and label sequences. As has been emphasized frequently, MLE based on the joint probability model is inherently non-discriminative and thus may lead to suboptimal prediction accuracy. Secondly, efficient inference and learning in this setting often requires to make questionable conditional independence assumptions.

artificial intelligence, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Dec-31-2003

Conferences PDF

Add feedback

Country:
- North America > United States > California > San Francisco County > San Francisco (0.15)

Industry:
- Health & Medicine > Pharmaceuticals & Biotechnology (0.49)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Representation & Reasoning > Uncertainty
    - Bayesian Inference (0.70)
  - Machine Learning > Learning Graphical Models
    - Undirected Networks > Markov Models (0.88)
    - Directed Networks > Bayesian Learning (0.70)

Duplicate Docs Excel Report

Title
Discriminative Learning for Label Sequences via Boosting
Discriminative Learning for Label Sequences via Boosting

Similar Docs Excel Report more

Title	Similarity	Source
None found