We present an iterative procedure to build a Chinese language model (LM). We seg-ment Chinese text into words based on a word-based Chinese language model. How-ever, the construction of a Chinese LM...
HowtogetaChineseName(Entity): Segmentation and Combination Issues (2007)
Jing, Hongyan, Florian, Radu, Luo, Xiaoqiang, Zhang, Tong, Ittycheriah, Abraham
When building a Chinese named entity recognition system, one must deal with certain language-specific issues such as whether the model should be based on characters or words. While there is no unique...
The Impact of Morphological Stemming on Arabic Mention Detection and Coreference Resolution (2005)
Imed Zitouni, Jeff Sorensen, Xiaoqiang Luo, Radu Florian
Arabic presents an interesting challenge to natural language processing, being a highly inflected and agglutinative language. In particular, this paper presents an in-depth investigation of the...
On coreference resolution performance metrics (2005)
The paper proposes a Constrained Entity-Alignment F-Measure (CEAF) for evaluating coreference resolution. The metric is computed by aligning reference and system entities (or coreference chains) with...
A Mention-Synchronous Coreference Resolution Algorithm Based on the Bell Tree (2004)
Xiaoqiang Luo, Abe Ittycheriah, Hongyan Jing, A Kambhatla, Salim Roukos
This paper proposes a new approach for coreference resolution which uses the Bell tree to represent the search space and casts the coreference resolution problem as finding the best path from the...
HowtogetaChineseName(Entity): Segmentation and combination issues (2003)
Hongyan Jing, Radu Florian, Xiaoqiang Luo, Tong Zhang, Abraham Ittycheriah
hjing,raduf,xiaoluo,tzhang,abeiĀ” When building a Chinese named entity recognition system, one must deal with certain language-specific issues such as whether the model should be based on characters...
Vita.
Speaker normalization with all-pass transforms (1998)
John Mcdonough, William Byrne, Xiaoqiang Luo
Speaker normalization is a process in which the short-time features of speech from a given speaker are transformed so as to better match some speaker independent model. Vocal tract length...
An Iterative Algorithm to Build Chinese Language Models (1996)
We present an iterative procedure to build a Chinese language model (LM). We segment Chinese text into words based on a word-based Chinese language model. However, the construction of a Chinese LM...