| How Many Bits Are Needed To Store Probabilities for Phrase-Based Translation? (2006) | |||||||||||||||
Abstract | |||||||||||||||
| State of the art in statistical machine translation is currently represented by phrasebased models, which typically incorporate a large number of probabilities of phrase-pairs and word n-grams. In this work, we investigate data compression methods for efficiently encoding n-gram and phrase-pair probabilities, that are usually encoded in 32-bit floating point numbers. | |||||||||||||||
Publication details | |||||||||||||||
| |||||||||||||||