Publication View

and (2008)

Abstract
A set containment join is a join between set-valued attributes of two relations, whose join condition is specified using the subset (⊆) operator. Set containment joins are deployed in many database applications, even those that do not support set-valued attributes. In this paper, we propose two novel partitioning algorithms, called the Adaptive Pick-and-Sweep Join (APSJ) and the Adaptive Divide-and-Conquer Join (ADCJ), which allow computing set containment joins efficiently. We show that APSJ outperforms previously suggested algorithms for many data sets, often by an order of magnitude. We present a detailed analysis of the algorithms and study their performance on real and synthetic data using an implemented testbed. Categories and Subject Descriptors: H.2.4 [Database Management]: Systems—query processing

Publication details
Download http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.93.9958
Source http://infolab.stanford.edu/~melnik/pub/melnik_TODS03.pdf
Contributors CiteSeerX
Repository CiteSeerX - Scientific Literature Digital Library and Search Engine (United States)
Keywords General Terms, Algorithms, Experimentation, Performance
Type text
Language English
Relation 10.1.1.43.2358, 10.1.1.24.8162, 10.1.1.49.9444, 10.1.1.34.4737, 10.1.1.44.6932, 10.1.1.106.3750, 10.1.1.17.1322, 10.1.1.63.9114, 10.1.1.21.362