Publication View

Shanghai Jiao-Tong University Shanghai, P.R.China (2008)

Abstract
Weblogs have become a prevalent source of information for people to express themselves. In general, there are two genres of contents in weblogs. The first kind is about the webloggers’ personal feelings, thoughts or emotions. We call this kind of weblogs affective articles. The second kind of weblogs is about technologies and different kinds of informative news. In this paper, we present a machine learning method for classifying informative and affective articles among weblogs. We consider this problem as a binary classification problem. By using machine learning approaches, we achieve about 92 % on information retrieval performance measures including precision, recall and F1. We set up three studies on the applications of above classification approach in both research and industrial fields. The above classification approach is used to improve the performance of classification of emotions from weblog articles. We also develop an intent-driven weblog-search engine based on the classification techniques to improve the satisfaction of Web users. Finally, our approach is applied to search for weblogs with a great deal of informative articles.

Publication details
Download http://citeseerx.ist.psu.edu/viewdoc/summary?doi=?doi=10.1.1.120.371
Source http://www.cs.ust.hk/~qyang/Docs/2007/wwwni2007.pdf
Contributors CiteSeerX
Repository CiteSeerX - Scientific Literature Digital Library and Search Engine (United States)
Keywords Categories and Subject Descriptors H.4.m [Information Systems, Miscellaneous, I.5.4 [Pattern Recognition, Application- Text processing General Terms Algorithms, Measurement, Performance, Design, Experimentation, Human Factors. Keywords Weblog, Informative article, Affective article, Classification, User Intent
Type text
Language English
Relation 10.1.1.117.4939, 10.1.1.32.9956, 10.1.1.21.7950, 10.1.1.19.1992, 10.1.1.3.2480, 10.1.1.1.9196, 10.1.1.13.8572, 10.1.1.30.2615, 10.1.1.118.2654, 10.1.1.2.8851