Statistical Machine TranslationCambridge University Press, 2010 - 433页 The dream of automatic language translation is now closer thanks to recent advances in the techniques that underpin statistical machine translation. This class-tested textbook from an active researcher in the field, provides a clear and careful introduction to the latest methods and explains how to build machine translation systems for any two languages. It introduces the subject's building blocks from linguistics and probability, then covers the major models for machine translation: word-based, phrase-based, and tree-based, as well as machine translation evaluation, language modeling, discriminative training and advanced methods to integrate linguistic annotation. The book also reports the latest research, presents the major outstanding challenges, and enables novices as well as experienced researchers to make novel contributions to this exciting area. Ideal for students at undergraduate and graduate level, or for anyone interested in the latest developments in machine translation. |
其他版本 - 查看全部
常见术语和短语
algorithm alignment points Association for Computational back-off best translation bigrams BLEU score candidate translations Chapter chart entries Computational Linguistics Conference counts decoding English words Europarl Europarl corpus example extracted Figure finite state transducer foreign word Frank Gehry grammar rules Haus human translators hypothesis IBM Model input sentence input words instance IWSLT Koehn language model lexical translation log-linear model machine learning machine translation models machine translation systems mapping match methods morphological n-best list n-gram Natural Language Processing node nonterminal noun phrase order n-gram output words parallel corpora parallel corpus parameter parse part-of-speech tags phrase pairs phrase-based models possible probability distribution problem Proceedings pruning reference translation reordering model sentence pair sequence span Spoken Language Translation stack statistical machine translation structure synchronous grammar tion training data translation options translation probability transliteration tree verb word alignment word error rate word-based Workshop on Spoken Workshop on Statistical Zhang