Difference between revisions of "IBM Model 2"
(Created page with 'One of the problems of the IBM Model 1 is that it is very weak to reordering, since <math>p(f,a|s)</math> is calculated using only the lexical translation probabilities <math…') |
|||
(2 intermediate revisions by the same user not shown) | |||
Line 1: | Line 1: | ||
+ | == Citation == | ||
+ | |||
+ | Brown, P. F., Pietra, V. J. D., Pietra, S. A. D., & Mercer, R. L. (1993). The mathematics of statistical machine translation: parameter estimation. Comput. Linguist., 19, 263–311. | ||
+ | |||
+ | == Online version == | ||
+ | |||
+ | [http://dl.acm.org/ft_gateway.cfm?id=972474&type=pdf&CFID=49761657&CFTOKEN=94001682 pdf] | ||
+ | |||
+ | == Summary== | ||
+ | IBM Model 2 is an extension to [IBM Model 1]. | ||
+ | |||
+ | This model addressed the weak reordering properties of [[IBM Model 1]] by modeling the absolution distortion between the words in parallel sentence. | ||
+ | |||
+ | == Model == | ||
+ | |||
One of the problems of the [[IBM Model 1]] is that it is very weak to reordering, since <math>p(f,a|s)</math> is calculated using only the lexical translation probabilities <math>tr(t|s)</math>. Because of this, if the model is presented with 2 translations candidates <math>t_1</math> and <math>t_2</math> with the same lexical translations, but with different reordering of the translated words, the model scores both translations with the same score. | One of the problems of the [[IBM Model 1]] is that it is very weak to reordering, since <math>p(f,a|s)</math> is calculated using only the lexical translation probabilities <math>tr(t|s)</math>. Because of this, if the model is presented with 2 translations candidates <math>t_1</math> and <math>t_2</math> with the same lexical translations, but with different reordering of the translated words, the model scores both translations with the same score. | ||
Latest revision as of 23:29, 29 September 2011
Contents
Citation
Brown, P. F., Pietra, V. J. D., Pietra, S. A. D., & Mercer, R. L. (1993). The mathematics of statistical machine translation: parameter estimation. Comput. Linguist., 19, 263–311.
Online version
Summary
IBM Model 2 is an extension to [IBM Model 1].
This model addressed the weak reordering properties of IBM Model 1 by modeling the absolution distortion between the words in parallel sentence.
Model
One of the problems of the IBM Model 1 is that it is very weak to reordering, since is calculated using only the lexical translation probabilities . Because of this, if the model is presented with 2 translations candidates and with the same lexical translations, but with different reordering of the translated words, the model scores both translations with the same score.
Mixture-based Alignment models~(IBM Model 2) addresses this problem by modeling the absolute distortion in the word positioning between the 2 languages, introducing an alignment probability distribution , where and are the word positions in the source and target sentences. Thus the equation for becomes:
Where the alignment probability distribution models the probability of a word in the position in the source sentence of being reordered into the position in the target sentence.