Using Error-Correcting Output Codes with Model-Refinement to Boost Centroid Text Classifier (2009)
In this work, we investigate the use of error-correcting output codes (ECOC) for boosting centroid text classifier. The implementation framework is to decompose one multi-class problem into multiple...
Combining Language Model with Sentiment Analysis for Opinion Retrieval of Blog-Post (2008)
Xiangwen Liao, Donglin Cao, Songbo Tan, Yue Liu, Guodong Ding, Xueqi Cheng
This paper describes our participation in Blog Opinion Retrieval task this year. We conduct experiments on “FirteX ” platform that is developed by our lab. Language Model is used to retrieve...
An Effective Approach to Enhance Centroid Classifier for Text Categorization (2008)
Abstract. Centroid Classifier has been shown to be a simple and yet effective method for text categorization. However, it is often plagued with model misfit (or inductive bias) incurred by its...
AN EFFICIENT GLOBAL OPTIMIZATION APPROACH FOR ROUGH SET BASED DIMENSIONALITY REDUCTION (2006)
Songbo Tan, Xueqi Cheng, Hongbo Xu
Abstract. The theory of rough set, proposed by Pawlak, provides a formal tool for knowledge discovery from imprecise and incomplete data. Dimensionality reduction (RED) is a crucial problem for rough...
A novel refinement approach for text categorization (2005)
Songbo Tan, Xueqi Cheng, Moustafa M. Ghanem, Bin Wang, Hongbo Xu
In this paper we present a novel strategy, DragPushing, for improving the performance of text classifiers. The strategy is generic and takes advantage of training errors to successively refine the...
Efficient algorithms for attributes reduction problem (2005)
Abstract. The theory of rough set, proposed by Pawlak, provides a formal tool for knowledge discovery from imprecise and incomplete data. Attributes reduction is a crucial problem for rough set based...