Gui-rong Xue

Publication List Details

Period

2002 - 2009

Number

31

Co-Authors

Self-taught Clustering (2009)

Wenyuan Dai, Qiang Yang, Gui-rong Xue, Yong Yu

This paper focuses on a new clustering task, called self-taught clustering. Self-taught clustering is an instance of unsupervised transfer learning, which aims at clustering a small collection of...

ABSTRACT Spectral Domain-Transfer Learning (2009)

Xiao Ling, Wenyuan Dai, Gui-rong Xue, Qiang Yang, Yong Yu

Traditional spectral classification has been proved to be effective in dealing with both labeled and unlabeled data when these data are from the same domain. In many real world applications, however,...

Deep Classifier: Automatically Categorizing Search Results into Large-Scale Hierarchies ABSTRACT (2009)

Dikan Xing, Gui-rong Xue, Qiang Yang, Yong Yu

Organizing Web search results into hierarchical categories facilitates users ’ browsing through Web search results, especially for ambiguous queries where the potential results are mixed together....

Deep Classification in Large-scale Text Hierarchies (2009)

Gui-rong Xue, Dikan Xing, Qiang Yang, Yong Yu

Most classification algorithms are best at categorizing the Web documents into a few categories, such as the top two levels in the Open Directory Project. Such a classification method does not give...

Topic-bridged PLSA for Cross-Domain Text Classification (2009)

Gui-rong Xue, Wenyuan Dai, Qiang Yang, Yong Yu

In many Web applications, such as blog classification and newsgroup classification, labeled data are in short supply. It often happens that obtaining labeled data in a new domain is expensive and...

Improving Query Translation for Cross-Language Information Retrieval using a Web-based Approach (2008)

Jian Hu, Gui-rong Xue, Hua-jun Zeng, Fan-yuan Ma, Ming Zhou

With the increasing popularity of the Internet, research on Cross-Language Information Retrieval (CLIR) is being paid much attention. Existing improving approaches for query translation such as noun...

Shanghai Jiao-Tong University Shanghai, P.R.China (2008)

Xiaochuan Ni, Gui-rong Xue, Xiao Ling, Yong Yu, Qiang Yang

Weblogs have become a prevalent source of information for people to express themselves. In general, there are two genres of contents in weblogs. The first kind is about the webloggers’ personal...

Research Track Paper Co-clustering based Classification for Out-of-domain Documents ABSTRACT (2008)

Wenyuan Dai, Gui-rong Xue, Qiang Yang, Yong Yu

In many real world applications, labeled data are in short supply. It often happens that obtaining labeled data in a new domain is expensive and time consuming, while there may be plenty of labeled...

Exploiting pagerank at different block level (2008)

Xue-mei Jiang, Gui-rong Xue, Wen-guan Song, Hua-jun Zeng, Zheng Chen, Wei-ying Ma

Abstract. In recent years, information retrieval methods focusing on the link analysis have been developed; The PageRank and HITS are two typical ones According to the hierarchical organization of...

Using Probabilistic Latent Semantic Analysis for Personalized Web Search (2008)

Chenxi Lin, Gui-rong Xue, Hua-jun Zeng, Yong Yu

Abstract. Web users use search engine to find useful information on the Internet. However current web search engines return answer to a query independent of specific user information need. Since web...

General Terms (2008)

Gui-rong Xue, Hua-jun Zeng, Zheng Chen, Wei-ying Ma, Hong-jiang Zhang, Chao-jun Lu

Current search engines generally employ link analysis techniques to web-page re-ranking. However, the same techniques are problematic in small webs, such as websites or intranet webs, because the...

LogMiningtoImprovethePerformanceofSiteSearch ∗ (2008)

Gui-rong Xue, Hua-jun Zeng, Zheng Chen, Wei-ying Ma, Chao-jun Lu

In despite of the popularity of current search engines, people still suffer search failure and lots of non-relevant results when finding some specific information from a specific website. This is...

Shanghai Jiao-Tong University (2008)

Gui-rong Xue, Hua-jun Zeng, Zheng Chen, Yong Yu

The performance of web search engines may often deteriorate due to the diversity and noisy information contained within web pages. User click-through data can be used to introduce more accurate...

Similarity Spreading: A Unified Framework for Similarity Calculation of Interrelated Objects (2008)

Gui-rong Xue, Hua-jun Zeng, Zheng Chen, Wei-ying Ma, Yong Yu

In many Web search applications, similarities between objects of one type (say, queries) can be affected by the similarities between their interrelated objects of another type (say, Web pages), and...

Optimizing Web Search using Spreading Activation on the Clickthrough Data (2008)

Gui-rong Xue, Shen Huang, Yong Yu, Hua-jun Zeng, Zheng Chen, Wei-ying Ma

Abstract. In this paper, we propose a mining algorithm to utilize the user clickthrough data to improve search performance. The algorithm first explores the relationship between queries and Web pages...

Shanghai Jiao-Tong University Shanghai, P.R.China (2008)

Xiaochuan Ni, Gui-rong Xue, Xiao Ling, Yong Yu, Qiang Yang

Weblogs have become a prevalent source of information for people to express themselves. In general, there are two genres of contents in weblogs. The first kind is about the webloggers’ personal...

DHT Based Searching Improved by Sliding Window (2008)

Shen Huang, Gui-rong Xue, Xing Zhu, Yan-feng Ge, Yong Yu

Abstract. Efficient full-text searching is a big challenge in Peer-to-Peer (P2P) system. Recently, Distributed Hash Table (DHT) becomes one of the reliable communication schemes for P2P. Some...

Puzzle J in Abstract (2008)

Xin-jing Wang, Wei-ying Ma, Gui-rong Xue, Xing Li

In this paper, we propose an iterative similarity propagation approach to explore the inter-relationships between Web images and their textual annotations for image retrieval. By considering Web...

Can chinese web pages be classified with english data source (2008)

Xiao Ling, Gui-rong Xue, Wenyuan Dai, Yun Jiang, Qiang Yang, Yong Yu

As the World Wide Web in China grows rapidly, mining knowledge in Chinese Web pages becomes more and more important. Mining Web information usually relies on the machine learning techniques which...

Transferring naive bayes classifiers for text classification (2007)

Wenyuan Dai, Gui-rong Xue, Qiang Yang, Yong Yu

A basic assumption in traditional machine learning is that the training and test data distributions should be identical. This assumption may not hold in many situations in practice, but we may be...

Boosting for transfer learning (2007)

Wenyuan Dai, Qiang Yang, Gui-rong Xue, Yong Yu

Traditional machine learning makes a basic assumption: the training and test data should be under the same distribution. However, in many cases, this identicaldistribution assumption does not hold....

Reinforcing web-object categorization through interrelationships. Data Mining and Knowledge Discovery (2006)

Gui-rong Xue, Yong Yu, Dou Shen, Qiang Yang, Hua-jun Zeng, Zheng Chen

Abstract. Existing categorization algorithms deal with homogeneous Web objects, and consider interrelated objects as additional features when taking the interrelationships with other types of objects...

Reinforcing Web-object Categorization through Interrelationships (2006)

Gui-Rong Xue, Yong Yu, Dou Shen, Qiang Yang, Zheng Chen

Existing categorization algorithms deal with homogeneous Web objects, and consider interrelated objects as additional features when taking the interrelationships with other types of objects into...

Scalable collaborative filtering using cluster-based smoothing (2005)

Gui-rong Xue, Chenxi Lin, Qiang Yang, Wensi Xi, Hua-jun Zeng, Yong Yu, ...

Memory-based approaches for collaborative filtering identify the similarity between two users by comparing their ratings on a set of items. In the past, the memory-based approaches have been shown to...

Exploiting the hierarchical structure for link analysis (2005)

Gui-rong Xue, Qiang Yang, Hua-jun Zeng, Yong Yu, Zheng Chen

Link analysis algorithms have been extensively used in Web information retrieval. However, current link analysis algorithms generally work on a flat link graph, ignoring the hierarchal structure of...

Exploiting the hierarchical structure for link analysis (2005)

Gui-rong Xue, Qiang Yang, Hua-jun Zeng, Yong Yu, Zheng Chen

Link analysis algorithms have been extensively used in Web information retrieval. However, current link analysis algorithms generally work on a flat link graph, ignoring the hierarchal structure of...

Multi-Type Features based Web Document Clustering (2004)

Shen Huang, Gui-rong Xue, Ben-yu Zhang, Zheng Chen, Yong Yu, Wei-ying Ma

Abstract. Clustering has been demonstrated as a feasible way to explore the contents of document collection and organize search engine results. For this task, many features of Web page, such as...

TSSP: A Reinforcement Algorithm to Find Related Papers (2004)

Shen Huang, Gui-rong Xue, Ben-yu Zhang, Zheng Chen, Yong Yu, Wei-ying Ma

Content analysis and citation analysis are two common methods in recommending system. Compared with content analysis, citation analysis can discover more implicitly related papers. However, the...

Irc: An iterative reinforcement categorization algorithm for interrelated web objects (2004)

Gui-rong Xue, Dou Shen, Qiang Yang, Hua-jun Zeng, Zheng Chen

Most existing categorization algorithms deal with homogeneous Web data objects, and consider interrelated objects as additional features when taking the interrelationships with other types of objects...

Log Mining to Improve the Performance of Site Search (2002)

Gui-rong Xue, Hua-jun Zeng, Zheng Chen, Wei-ying Ma, Chao-jun Lu

Despite of the popularity of global search engines, people still suffer from low accuracy of site search. The primary reason lies in the difference of link structures and data scale between global...