Difference between revisions of "IBM Model 2"

From Cohen Courses
Jump to navigationJump to search
(Created page with 'One of the problems of the IBM Model 1 is that it is very weak to reordering, since <math>p(f,a|s)</math> is calculated using only the lexical translation probabilities <math…')
 
 
(2 intermediate revisions by the same user not shown)
Line 1: Line 1:
 +
== Citation ==
 +
 +
Brown, P. F., Pietra, V. J. D., Pietra, S. A. D., & Mercer, R. L. (1993). The mathematics of statistical machine translation: parameter estimation. Comput. Linguist., 19, 263–311.
 +
 +
== Online version ==
 +
 +
[http://dl.acm.org/ft_gateway.cfm?id=972474&type=pdf&CFID=49761657&CFTOKEN=94001682 pdf]
 +
 +
== Summary==
 +
IBM Model 2 is an extension to [IBM Model 1].
 +
 +
This model addressed the weak reordering properties of [[IBM Model 1]] by modeling the absolution distortion between the words in parallel sentence.
 +
 +
== Model ==
 +
 
One of the problems of the [[IBM Model 1]] is that it is very weak to reordering, since <math>p(f,a|s)</math> is calculated using only the lexical translation probabilities <math>tr(t|s)</math>. Because of this, if the model is presented with 2 translations candidates <math>t_1</math> and <math>t_2</math> with the same lexical translations, but with different reordering of the translated words, the model scores both translations with the same score.  
 
One of the problems of the [[IBM Model 1]] is that it is very weak to reordering, since <math>p(f,a|s)</math> is calculated using only the lexical translation probabilities <math>tr(t|s)</math>. Because of this, if the model is presented with 2 translations candidates <math>t_1</math> and <math>t_2</math> with the same lexical translations, but with different reordering of the translated words, the model scores both translations with the same score.  
  

Latest revision as of 23:29, 29 September 2011

Citation

Brown, P. F., Pietra, V. J. D., Pietra, S. A. D., & Mercer, R. L. (1993). The mathematics of statistical machine translation: parameter estimation. Comput. Linguist., 19, 263–311.

Online version

pdf

Summary

IBM Model 2 is an extension to [IBM Model 1].

This model addressed the weak reordering properties of IBM Model 1 by modeling the absolution distortion between the words in parallel sentence.

Model

One of the problems of the IBM Model 1 is that it is very weak to reordering, since is calculated using only the lexical translation probabilities . Because of this, if the model is presented with 2 translations candidates and with the same lexical translations, but with different reordering of the translated words, the model scores both translations with the same score.

Mixture-based Alignment models~(IBM Model 2) addresses this problem by modeling the absolute distortion in the word positioning between the 2 languages, introducing an alignment probability distribution , where and are the word positions in the source and target sentences. Thus the equation for becomes:

Where the alignment probability distribution models the probability of a word in the position in the source sentence of being reordered into the position in the target sentence.