Using Vocabulary Knowledge in Bayesian Multinomial Estimation

Griffiths, Thomas L., Tenenbaum, Joshua B.

Dec-31-2002–Neural Information Processing Systems

Recent approaches have used uncertainty over the vocabulary of symbols in a multinomial distribution as a means of accounting for sparsity. We present a Bayesian approach that allows weak prior knowledge, in the form of a small set of approximate candidate vocabularies, to be used to dramatically improve the resulting estimates. We demonstrate these improvements in applications to text compression andestimating distributions over words in newsgroup data. 1 Introduction Sparse multinomial distributions arise in many statistical domains, including natural languageprocessing and graphical models. Consequently, a number of approaches toparameter estimation for sparse multinomial distributions have been suggested [3]. These approaches tend to be domain-independent: they make little use of prior knowledge about a specific domain.

artificial intelligence, bayesian inference, knowledge, (20 more...)

Neural Information Processing Systems

Dec-31-2002

Conferences PDF

Add feedback

Country:
- North America > United States > California > Santa Clara County (0.14)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Learning Graphical Models
    - Directed Networks > Bayesian Learning (0.90)
  - Natural Language (1.00)
  - Representation & Reasoning > Uncertainty
    - Bayesian Inference (1.00)

Duplicate Docs Excel Report

Title
Using Vocabulary Knowledge in Bayesian Multinomial Estimation
Using Vocabulary Knowledge in Bayesian Multinomial Estimation

Similar Docs Excel Report more

Title	Similarity	Source
None found