Fast and exact out-of-core and distributed k-means clustering (2008)
Ruoming Jin, Anjan Goswami, Gagan Agrawal
Clustering has been one of the most widely studied topics in data mining and k-means clustering has been one of the popular clustering algorithms. K-means requires several passes on the entire...
ABSTRACT Accurate One Pass Decision Tree Construction (2008)
Ruoming Jin, Anjan Goswami, Gagan Agrawal
In mining continuous data streams, one popular paradigm is using sampling and having a one-pass algorithm with probabilistic bound on the accuracy. A key application of this approach is in decision...
Efficient clustering algorithms for out of core, distributed and streaming data / (2005)
Thesis (M.S.)--Ohio State University, 2005.
Fast and exact out-of-core k-means clustering (2004)
Clustering has been one of the most widely studied topics in data mining and k-means clustering has been one of the popular clustering algorithms. K-means requires several passes on the entire...
Clustering has been one of the most widely studied topics in data mining and it is often the first step of data mining process. Today, we are witnessing enormous growth in data volume. Often, data is...