Smoothing

From Cohen Courses
Revision as of 22:41, 30 March 2011 by Dwijaya (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

From Wikipedia:

In statistical language modeling, in a bag of words model for example, the data consists of the number of occurrences of each word in a document. Smoothing allows the assignment of non-zero probabilities to words which do not occur in the sample. From a Bayesian point of view, this corresponds to the expected value of the posterior distribution of words, using a Dirichlet distribution with parameter α as a prior.

External Link

Relevant Papers