Publication View

Top-k Query Processing in Uncertain Databases (2007)

Abstract
Top-k processing in uncertain databases is semantically and computationally different from traditional top-k processing. The interplay between score and uncertainty information makes traditional top-k processing techniques inapplicable to uncertain databases. In this paper we introduce new probabilistic formulations for top-k queries. Our formulations are based on marriage of traditional top-k semantics with possible worlds semantics. In the light of these formulations, we construct a framework that encapsulates a novel probabilistic model and efficient query processing techniques to tackle the challenges raised by uncertain data settings. We prove that our techniques minimize the number of accessed tuples, and the number of materialized possible query answers. Our experiments show the efficiency of our techniques under different data distributions with orders of magnitude improvement over naïve materialization of possible worlds. 1

Publication details
Download http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.93.7168
Source http://www.cs.uwaterloo.ca/research/tr/2006/CS-2006-25.pdf
Contributors CiteSeerX
Repository CiteSeerX - Scientific Literature Digital Library and Search Engine (United States)
Type text
Language English
Relation 10.1.1.101.3165, 10.1.1.60.5590, 10.1.1.118.2067, 10.1.1.53.293, 10.1.1.46.6676, 10.1.1.108.6943, 10.1.1.2.3045, 10.1.1.119.3771, 10.1.1.32.9542, 10.1.1.11.2693, 10.1.1.138.1157, 10.1.1.103.645, 10.1.1.125.2419, 10.1.1.88.5501, 10.1.1.136.558, 10.1.1.79.2589, 10.1.1.125.6494, 10.1.1.71.1644, 10.1.1.131.212, 10.1.1.113.5449