Publication View

Abstract (2008)

Abstract
We study the expressiveness and performance of full-text search languages. Our main motivation is to provide a formal basis for comparing such languages and to develop a model for full-text search that can be tightly integrated with structured search. We develop a formal model for full-text search based on the positions of tokens (words) in the input text, and develop a full-text calculus (FTC) and a full-text algebra (FTA) with equivalent expressive power. This suggests a notion of completeness for full-text search languages and can be used as a basis for a study of their expressiveness. We show that existing full-text languages are incomplete and developCOMP, a complete full-text search language. We also identify practical subsets ofCOMP that are more powerful than existing languages, develop efficient query evaluation algorithms for these subsets, and study experimentally their performance. 1

Publication details
Download http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.86.4595
Source http://www.cs.cornell.edu/~cbotev/completeness-tr.pdf
Contributors CiteSeerX
Repository CiteSeerX - Scientific Literature Digital Library and Search Engine (United States)
Type text
Language English
Relation 10.1.1.27.7690, 10.1.1.20.5942, 10.1.1.12.1394, 10.1.1.102.4741, 10.1.1.11.9191, 10.1.1.12.2124, 10.1.1.15.2825, 10.1.1.133.3445, 10.1.1.133.2764, 10.1.1.12.902