Basis
The idea behind statistical machine translation comes from information theory. A document is translated according to the probability distribution that a string in the target language (for example, English) is the translation of a string in the source language (for example, French).
The problem of modeling the probability distribution has been approached in a number of ways. One intuitive approach is to apply Bayes Theorem, that is, where the translation model is the probability that the source string is the translation of the target string, and the language model is the probability of seeing that target language string. This decomposition is attractive as it splits the problem into two subproblems. Finding the best translation is done by picking up the one that gives the highest probability:
- .
For a rigorous implementation of this one would have to perform an exhaustive search by going through all strings in the native language. Performing the search efficiently is the work of a machine translation decoder that uses the foreign string, heuristics and other methods to limit the search space and at the same time keeping acceptable quality. This trade-off between quality and time usage can also be found in speech recognition.
As the translation systems are not able to store all native strings and their translations, a document is typically translated sentence by sentence, but even this is not enough. Language models are typically approximated by smoothed n-gram models, and similar approaches have been applied to translation models, but there is additional complexity due to different sentence lengths and word orders in the languages.
The statistical translation models were initially word based (Models 1-5 from IBM Hidden Markov model from Stephan Vogel and Model 6 from Franz-Joseph Och), but significant advances were made with the introduction of phrase based models. Recent work has incorporated syntax or quasi-syntactic structures.
Read more about this topic: Statistical Machine Translation
Famous quotes containing the word basis:
“Adolescents, for all their self-involvement, are emerging from the self-centeredness of childhood. Their perception of other people has more depth. They are better equipped at appreciating others reasons for action, or the basis of others emotions. But this maturity functions in a piecemeal fashion. They show more understanding of their friends, but not of their teachers.”
—Terri Apter (20th century)
“The terrors of the child are quite reasonable, and add to his loveliness; for his utter ignorance and weakness, and his enchanting indignation on such a small basis of capital compel every bystander to take his part.”
—Ralph Waldo Emerson (18031882)
“The self ... might be regarded as a sort of citadel of the mind, fortified without and containing selected treasures within, while love is an undivided share in the rest of the universe. In a healthy mind each contributes to the growth of the other: what we love intensely or for a long time we are likely to bring within the citadel, and to assert as part of ourself. On the other hand, it is only on the basis of a substantial self that a person is capable of progressive sympathy or love.”
—Charles Horton Cooley (18641929)