Publication View

Expressiveness and performance of full-text search languages (2006)

Abstract
Abstract. We study the expressiveness and performance of full-text search languages. Our motivation is to provide a formal basis for comparing full-text search languages and to develop a model for full-text search that can be tightly integrated with structured search. We design a model based on the positions of tokens (words) in the input text, and develop a full-text calculus (FTC) and a full-text algebra (FTA) with equivalent expressive power; this suggests a notion of completeness for full-text search languages. We show that existing full-text languages are incomplete and identify a practical subset of the FTC and FTA that is more powerful than existing languages, but which can still be evaluated efficiently. 1

Publication details
Download http://citeseerx.ist.psu.edu/viewdoc/summary?doi=?doi=10.1.1.114.4182
Source http://www.cs.cornell.edu/~cbotev/expressiveness.pdf
Contributors CiteSeerX
Repository CiteSeerX - Scientific Literature Digital Library and Search Engine (United States)
Keywords as a disjunction or conjunction of query
Type text
Language English
Relation 10.1.1.87.9634, 10.1.1.112.869, 10.1.1.12.1394, 10.1.1.56.5928, 10.1.1.11.9191, 10.1.1.21.2920, 10.1.1.15.2825, 10.1.1.133.3445, 10.1.1.12.902, 10.1.1.8.3210, 10.1.1.32.7182, 10.1.1.124.652