| Deep Classification in Large-scale Text Hierarchies (2009) | |||||||||||||||||
Abstract | |||||||||||||||||
| Most classification algorithms are best at categorizing the Web documents into a few categories, such as the top two levels in the Open Directory Project. Such a classification method does not give very detailed topic-related class information for the user because the first two levels are often too coarse. However, classification on a large-scale hierarchy is known to be intractable for many target categories with cross-link relationships among them. In this paper, we propose a novel deep-classification approach to categorize Web documents into categories in a large-scale taxonomy. The approach consists of two stages: a search stage and a classification stage. In the first stage, a category-search algorithm is used to acquire the | |||||||||||||||||
Publication details | |||||||||||||||||
| |||||||||||||||||