Jian Hu, Gui-rong Xue, Hua-jun Zeng, Fan-yuan Ma, Ming Zhou
With the increasing popularity of the Internet, research on Cross-Language Information Retrieval (CLIR) is being paid much attention. Existing improving approaches for query translation such as noun...
Exploiting pagerank at different block level (2008)
Xue-mei Jiang, Gui-rong Xue, Wen-guan Song, Hua-jun Zeng, Zheng Chen, Wei-ying Ma
Abstract. In recent years, information retrieval methods focusing on the link analysis have been developed; The PageRank and HITS are two typical ones According to the hierarchical organization of...
Using Probabilistic Latent Semantic Analysis for Personalized Web Search (2008)
Chenxi Lin, Gui-rong Xue, Hua-jun Zeng, Yong Yu
Abstract. Web users use search engine to find useful information on the Internet. However current web search engines return answer to a query independent of specific user information need. Since web...
Gui-rong Xue, Hua-jun Zeng, Zheng Chen, Wei-ying Ma, Hong-jiang Zhang, Chao-jun Lu
Current search engines generally employ link analysis techniques to web-page re-ranking. However, the same techniques are problematic in small webs, such as websites or intranet webs, because the...
LogMiningtoImprovethePerformanceofSiteSearch ∗ (2008)
Gui-rong Xue, Hua-jun Zeng, Zheng Chen, Wei-ying Ma, Chao-jun Lu
In despite of the popularity of current search engines, people still suffer search failure and lots of non-relevant results when finding some specific information from a specific website. This is...
Shanghai Jiao-Tong University (2008)
Gui-rong Xue, Hua-jun Zeng, Zheng Chen, Yong Yu
The performance of web search engines may often deteriorate due to the diversity and noisy information contained within web pages. User click-through data can be used to introduce more accurate...
Similarity Spreading: A Unified Framework for Similarity Calculation of Interrelated Objects (2008)
Gui-rong Xue, Hua-jun Zeng, Zheng Chen, Wei-ying Ma, Yong Yu
In many Web search applications, similarities between objects of one type (say, queries) can be affected by the similarities between their interrelated objects of another type (say, Web pages), and...
Optimizing Web Search using Spreading Activation on the Clickthrough Data (2008)
Gui-rong Xue, Shen Huang, Yong Yu, Hua-jun Zeng, Zheng Chen, Wei-ying Ma
Abstract. In this paper, we propose a mining algorithm to utilize the user clickthrough data to improve search performance. The algorithm first explores the relationship between queries and Web pages...
Jian Hu, Hua-jun Zeng, Hua Li, Cheng Niu, Zheng Chen
Demographic information plays an important role in personalized web applications. However, it is usually not easy to obtain this kind of personal data such as age and gender. In this paper, we made a...
ABSTRACT CWS: A Comparative Web Search System (2008)
Jian-tao Sun, Xuanhui Wang, Dou Shen, Hua-jun Zeng, Zheng Chen
In this paper, we define and study a novel search problem: Comparative Web Search (CWS). The task of CWS is to seek relevant and comparative information from the Web to help users conduct comparisons...
Gui-rong Xue, Yong Yu, Dou Shen, Qiang Yang, Hua-jun Zeng, Zheng Chen
Abstract. Existing categorization algorithms deal with homogeneous Web objects, and consider interrelated objects as additional features when taking the interrelationships with other types of objects...
Mining Clickthrough Data for Collaborative Web Search (2006)
Jian-tao Sun, Xuanhui Wang, Dou Shen, Hua-jun Zeng, Zheng Chen
Scalable collaborative filtering using cluster-based smoothing (2005)
Gui-rong Xue, Chenxi Lin, Qiang Yang, Wensi Xi, Hua-jun Zeng, Yong Yu, ...
Memory-based approaches for collaborative filtering identify the similarity between two users by comparing their ratings on a set of items. In the past, the memory-based approaches have been shown to...
CubeSVD: A Novel Approach to Personalized Web Search (2005)
Jian-tao Sun, Hua-Jun Zeng, Huan Liu, Yuchang Lu
As the competition of Web search market increases, there is a high demand for personalized Web search to conduct retrieval incorporating Web users' information needs. This paper focuses on...
An Experimental Study on Large-Scale Web Categorization Tie-Yan LIU (2005)
Tie-yan Liu, Yiming Yang Hao, Hao Wan, Qian Zhou, Bin Gao, Hua-jun Zeng, ...
Taxonomies of the Web typically have hundreds of thousands of categories and skewed category distribution over documents. It is not clear whether existing text classification technologies can perform...
Support vector machines classification with a very large-scale taxonomy (2005)
Tie-yan Liu, Yiming Yang, Hao Wan, Hua-jun Zeng, Zheng Chen, Wei-ying Ma
Very large-scale classification taxonomies typically have hundreds
Exploiting the hierarchical structure for link analysis (2005)
Gui-rong Xue, Qiang Yang, Hua-jun Zeng, Yong Yu, Zheng Chen
Link analysis algorithms have been extensively used in Web information retrieval. However, current link analysis algorithms generally work on a flat link graph, ignoring the hierarchal structure of...
Exploiting the hierarchical structure for link analysis (2005)
Gui-rong Xue, Qiang Yang, Hua-jun Zeng, Yong Yu, Zheng Chen
Link analysis algorithms have been extensively used in Web information retrieval. However, current link analysis algorithms generally work on a flat link graph, ignoring the hierarchal structure of...
Learning to cluster web search results (2004)
Hua-jun Zeng, Qi-cai He, Zheng Chen, Wei-ying Ma, Jinwen Ma
Organizing Web search results into clusters facilitates users ' quick browsing through search results. Traditional clustering techniques are inadequate since they don't generate clusters...
Learning to cluster web search results (2004)
Hua-jun Zeng, Qi-cai He, Zheng Chen, Wei-ying Ma, Jinwen Ma
Organizing Web search results into clusters facilitates users ' quick browsing through search results. Traditional clustering techniques are inadequate since they don't generate clusters...
Irc: An iterative reinforcement categorization algorithm for interrelated web objects (2004)
Gui-rong Xue, Dou Shen, Qiang Yang, Hua-jun Zeng, Zheng Chen
Most existing categorization algorithms deal with homogeneous Web data objects, and consider interrelated objects as additional features when taking the interrelationships with other types of objects...
CBC: Clustering Based Text Classification (2003)
Requiring Minimal Labeled, Hua-jun Zeng, Xuan-hui Wang, Zheng Chen, Wei-ying Ma
Semi-supervised learning methods construct classifiers using both labeled and unlabeled training data samples. While unlabeled data samples can help to improve the accuracy of trained models to...
Log Mining to Improve the Performance of Site Search (2002)
Gui-rong Xue, Hua-jun Zeng, Zheng Chen, Wei-ying Ma, Chao-jun Lu
Despite of the popularity of global search engines, people still suffer from low accuracy of site search. The primary reason lies in the difference of link structures and data scale between global...