Publication View

Implementing the context tree weighting method for context recognition (2007)

Abstract
The context tree weighting method (CTW) is a competent statistics–based universal data com-pression algorithm proposed by Willems et al. in 1995. Motivated by the superior data compression performance of CTW over Lempel–Ziv based algorithms, we thoroughly investigate in this work the usability of CTW to applications involving content recognition and classification by means of data compression. Given a test file that needs to be classified into one of several reference files, the reference file which leads to the best compression of the test file, when both files are appended, is selected as the most probable match. Moreover, based on its algorithmic structure, we propose the concept of context tree freezing to improve the implementation of CTW for content recognition. We demonstrate the superiority of our implementation of CTW with freezing by comparing its performance with a wide range of data compression algorithms for content recognition problems such as language recognition, authorship attribution, and even DNA data classification. I.

Publication details
Download http://citeseerx.ist.psu.edu/viewdoc/summary?doi=?doi=10.1.1.3.5079
Source http://www.lnt.ei.tum.de/mitarbeiter/dawy/download/ZaherDawy_DCC04.pdf
Contributors CiteSeerX
Repository CiteSeerX - Scientific Literature Digital Library and Search Engine (United States)
Type text
Language English
Relation 10.1.1.1.3346, 10.1.1.30.1819, 10.1.1.109.3733, 10.1.1.50.9317, 10.1.1.26.1074, 10.1.1.124.1232