| Exploiting Syntactic, Semantic and Lexical Regularities in Language Modeling via Directed Markov Random Fields (2005) | |||||||||||||||
Abstract | |||||||||||||||
| We present a directed Markov random field (MRF) model that combines n-gram models, probahilistic context free grammars (l'C FGs) and probabilistic latent semantic analysis (PLSA) for the purpose of statistical language modeling. The composite directed MRF model has potentially exponential number of loops and be-comes context sensitive grammar, nevertheless we are able to esti-mate its parameters in cubic time using an efficient modified EM method, the generalized inside-outside algorithm, which extends inside-outside algorithm to incorporate the effects of the n-gram and PLSA language models. 1. | |||||||||||||||
Publication details | |||||||||||||||
| |||||||||||||||