Difference between revisions of "Inside Outside algorithm"
m |
m |
||
Line 19: | Line 19: | ||
The inside probability can be calculated recursively with the following recurrence relation: | The inside probability can be calculated recursively with the following recurrence relation: | ||
− | <math>\alpha(A,i,j)=\sum_{B,C}\sum_{i\leq k\leq j}</math> | + | <math>\alpha(A,i,j)=\sum_{B,C}\sum_{i\leq k\leq j}p(A\rightarrow B C) \alpha(B,i,k) \alpha(C,k+1,j)</math> |
=== Outside counts === | === Outside counts === |
Revision as of 11:43, 29 November 2011
This is a Method page for the Inside-outside algorithm.
Background
The inside-outside algorithm is a way of estimating probabilities in a PCFG. It is first introduced [| Baker, 1979]. The inside outside algorithm is in fact a generalization of the forward-backward algorithm (for hidden Markov models) to PCFGs.
It is often used as part of the EM algorithm for computing expectations.
Algorithm
The algorithm is a dynamic programming algorithm that is often used with chart parsers to estimate expected production counts. Here, we assume the grammar is of Chomsky Normal Form.
The algorithm works by computing 2 probabilities for each nonterminal and span .
Inside probabilities
The inside probability is defined as , which is the probability of a nonterminal generating the word sequence to .
The inside probability can be calculated recursively with the following recurrence relation: