Training Linear Discriminant Analysis in Linear Time (2008)
Abstract—Linear Discriminant Analysis (LDA) has been a popular method for extracting features which preserve class separability. It has been widely used in many fields of information processing,...
Neighborhood Preserving Embedding (NPE) and Marginal (2008)
Subspace learning based face recognition methods have attracted considerable interests in recent years, including
Locality sensitive discriminant analysis (2008)
Deng Cai, Jiawei Han, Xiaofei He, Kun Zhou, Hujun Bao
Linear Discriminant Analysis (LDA) is a popular data-analytic tool for studying the class relationship between data points. A major disadvantage of LDA is that it fails to discover the local...
ABSTRACT Topic Modeling with Network Regularization (2008)
Qiaozhu Mei, Deng Cai, Duo Zhang, Chengxiang Zhai
In this paper, we formally define the problem of topic modeling with network structure (TMN). We propose a novel solution to this problem, which regularizes a statistical topic model with a harmonic...
Deng Cai, Shipeng Yu, Ji-rong Wen, Wei-ying Ma
A new web content structure analysis based on visual representation is proposed in this paper. Many web applications such as information retrieval, information extraction and automatic page...
Document Clustering Using Locality Preserving ∗ corresponding author Indexing (2008)
Deng Cai, Xiaofei He, Jiawei Han
1 We propose a novel document clustering method, which aims to cluster the docu-ments into different semantic classes. The document space is generally of high dimen-sionality, and clustering in such...
Fisher Analysis (MFA) and Local Discriminant Embedding (2008)
Deng Cai, Xiaofei He, Yuxiao Hu, Jiawei Han, Thomas Huang
Subspace learning based face recognition methods have attracted considerable interests in recently years, including
ABSTRACT Laplacian Optimal Design for Image Retrieval (2008)
Xiaofei He, Wanli Min, Kun Zhou, Deng Cai
Relevance feedback is a powerful technique to enhance Content-Based Image Retrieval (CBIR) performance. It solicits the user’s relevance judgments on the retrieved images returned by the CBIR...
and Retrieval—Relevance feedback (2008)
Deng Cai, Xiaofei He, Jiawei Han
Recently, there have been considerable interests in geometric-based methods for image retrieval. These methods consider the image space as a smooth manifold and apply manifold learning techniques to...
ABSTRACT Orthogonal Locality Preserving Indexing (2008)
We consider the problem of document indexing and representation. Recently, Locality Preserving Indexing (LPI) was proposed for learning a compact document subspace. Different from Latent Semantic...
ABSTRACT Tensor Space Model for Document Analysis (2008)
Vector Space Model (VSM) has been at the core of information retrieval for the past decades. VSM considers the documents as vectors in high dimensional space. In such a vector space, techniques like...
∗ corresponding author Orthogonal Laplacianfaces for Face Recognition (2008)
Deng Cai, Xiaofei He, Jiawei Han, Acm Fellow
1 Following the intuition that the naturally occurring face data may be generated by sampling a probability distribution that has support on or near a sub-manifold of ambient space, we propose an...
ABSTRACT Image Clustering with Tensor Representation ∗ (2008)
Xiaofei He, Deng Cai, Haifeng Liu, Jiawei Han
We consider the problem of image representation and clustering. Traditionally, an n1 × n2 image is represented by a vector in the Euclidean space R n1×n2. Some learning algorithms are then applied...
(Roweis & Saul, 2000), Laplacian (2008)
Xiaofei He, Deng Cai, Wanli Min
Recently, several manifold learning algorithms have been proposed, such as ISOMAP (Tenenbaum et al., 2000), Locally Linear Embedding
Spectral Regression: A Unified Approach for Sparse Subspace Learning ∗ (2008)
Recently the problem of dimensionality reduction (or, subspace learning) has received a lot of interests in many fields of information processing, including data mining, information retrieval, and...
Deng Cai, Xiaofei He, Jiawei Han, Deng Cai, Xiaofei He, Jiawei Han
Recently the problem of dimensionality reduction has received a lot of interests in many fields of information processing, including data mining, information retrieval, and pattern recognition. We...
Efficient kernel discriminant analysis via spectral regression (2007)
Linear Discriminant Analysis (LDA) has been a popular method for extracting features which preserve class separability. The projection vectors are commonly obtained by maximizing the between class...
H.: Locality sensitive discriminant analysis (2007)
Linear Discriminant Analysis (LDA) is a popular data-analytic tool for studying the class relationship between data points. A major disadvantage of LDA is that it fails to discover the local...
Regularized locality preserving indexing via spectral regression (2007)
Deng Cai, Xiaofei He, Wei Vivian Zhang, Jiawei Han
We consider the problem of document indexing and representation. Recently, Locality Preserving Indexing (LPI) was proposed for learning a compact document subspace. Different from Latent Semantic...
Spectral regression: A unified subspace learning framework for content-based image retrieval (2007)
Relevance feedback is a well established and effective framework for narrowing down the gap between low-level visual features and high-level semantic concepts in content-based image retrieval. In...
Semi-Supervised Regression using Spectral Techniques ∗ (2006)
Deng Cai, Xiaofei He, Jiawei Han, Deng Cai, Xiaofei He, Jiawei Han
Graph-based approaches for semi-supervised learning have received increasing amount of interest in recent years. Despite their good performance, many pure graph based algorithms do not have explicit...
Tensor space model for document analysis (2006)
Deng Cai, Deng Cai, Xiaofei He, Xiaofei He, Jiawei Han, Jiawei Han
Vector Space Model (VSM) has been at the core of information retrieval for the past decades. VSM considers the documents as vectors in high dimensional space. In such a vector space, tech-niques like...
Mining hidden community in heterogeneous social networks (2005)
Deng Cai, Zheng Shao, Xiaofei He, Xifeng Yan, Jiawei Han
Social network analysis has attracted much attention in recent years. Community mining is one of the major directions in social network analysis. Most of the existing methods on community mining...
Mining hidden community in heterogeneous social networks (2005)
Deng Cai, Zheng Shao, Xiaofei He, Xifeng Yan, Jiawei Han, Deng Cai, ...
Social network analysis has attracted much attention in recent years. Community mining is one of the major directions in social network analysis. Most of the existing methods on community mining...
Neighborhood preserving embedding (2005)
Xiaofei He, Deng Cai, Shuicheng Yan, Hong-jiang Zhang
Recently there has been a lot of interest in geometrically motivated approaches to data analysis in high dimensional spaces. We consider the case where data is drawn from sampling a probability...
Laplacian score for feature selection (2005)
Xiaofei He, Deng Cai, Partha Niyogi
In supervised learning scenarios, feature selection has been studied widely in the literature. Selecting features in unsupervised learning scenarios is a much harder problem, due to the absence of...
Tensor subspace analysis (2005)
Xiaofei He, Deng Cai, Partha Niyogi
Previous work has demonstrated that the image variations of many objects (human faces in particular) under variable lighting can be effectively modeled by low dimensional linear spaces. The typical...
Community mining from multi-relational networks (2005)
Deng Cai, Zheng Shao, Xiaofei He, Xifeng Yan, Jiawei Han
Abstract. Social network analysis has attracted much attention in recent years. Community mining is one of the major directions in social network analysis. Most of the existing methods on community...
Locality Preserving Indexing for Document Representation (2004)
Document representation and indexing is a key problem for document analysis and processing, such as clustering, classification and retrieval. Conventionally, Latent Semantic Indexing (LSI) is...
Deng Cai, Shipeng Yu, Ji-rong Wen, Wei-ying Ma
Multiple-topic and varying-length of web pages are two negative factors significantly affecting the performance of web search. In this paper, we explore the use of page segmentation algorithms to...
Organizing WWW Images Based on The Analysis of Page Layout and Web Link Structure (2004)
Deng Cai, Xiaofei He, Wei-ying Ma, Ji-rong Wen, Hongjiang Zhang
Due to the rapid growth of the number of digital images on the Web, there is an increasing demand for effective and efficient method for organizing and retrieving the images available. This paper...
Block-level Link Analysis (2004)
Deng Cai, Xiaofei He, Ji-rong Wen, Wei-ying Ma
Link Analysis has shown great potential in improving the performance of web search. PageRank and HITS are two of the most popular algorithms. Most of the existing link analysis algorithms treat a web...
Deng Cai, Shipeng Yu, Ji-rong Wen, Wei-ying Ma
Multiple-topic and varying-length of web pages are two negative factors significantly affecting the performance of web search. In this paper, we explore the use of page segmentation algorithms to...
Organizing WWW Images Based on The Analysis of Page Layout and Web Link Structure (2004)
Deng Cai, Xiaofei He, Wei-ying Ma, Ji-rong Wen, Hongjiang Zhang
Due to the rapid growth of the number of digital images on the Web, there is an increasing demand for effective and efficient method for organizing and retrieving the images available. This paper...
Locality preserving clustering for image database (2004)
Xin Zheng, Deng Cai, Xiaofei He, Wei-ying Ma, Xueyin Lin
It is important and challenging to make the growing image repositories easy to search and browse. Image clustering is a technique that helps in several ways, including image data preprocessing, user...
Locality Preserving Indexing for Document Representation (2004)
Document representation and indexing is a key problem for document analysis and processing, such as clustering, classification and retrieval. Conventionally, Latent Semantic Indexing (LSI) is...
Deng Cai, Xiaofei He, Zhiwei Li, Wei-ying Ma, Ji-rong Wen
We consider the problem of clustering Web image search results. Generally, the image search results returned by an image search engine contain multiple topics. Organizing the results into different...
Xiaofei He, Deng Cai, Ji-rong Wen, Wei-ying Ma, Hong-jiang Zhang
Due to the rapid growth of the number of digital images on the Web, there is an increasing demand for effective and efficient method for organizing and retrieving the images available. This paper...
Block-level Link Analysis (2004)
Deng Cai, Xiaofei He, Ji-rong Wen, Wei-ying Ma
Link Analysis has shown great potential in improving the performance of web search. PageRank and HITS are two of the most popular algorithms. Most of the existing link analysis algorithms treat a web...
Deng Cai, Xiaofei He, Zhiwei Li, Wei-ying Ma, Ji-rong Wen
We consider the problem of clustering Web image search results. Generally, the image search results returned by an image search engine contain multiple topics. Organizing the results into different...
ImageSeer: Clustering and Searching WWW Images (2004)
Xiaofei He, Deng Cai, Ji-rong Wen, Wei-ying Ma, Hong-jiang Zhang
Due to the rapid growth of the number of digital images on the Web, there is an increasing demand for effective and efficient method for organizing and retrieving the images available. This paper...
Improving Pseudo-Relevance Feedback in Web Information Retrieval Using Web Page Segmentation (2003)
Yu, Shipeng, Cai, Deng, Wen, Ji-Rong, Ma, Wei-Ying
In contrast to traditional document retrieval, a web page as a whole is not a good information unit to search because it often contains multiple topics and a lot of irrelevant information from...
Improving pseudo-relevance feedback in web information retrieval using web page segmentation (2003)
Shipeng Yu, Deng Cai, Ji-rong Wen, Wei-ying Ma
In contrast to traditional document retrieval, a web page as a whole is not a good information unit to search because it often contains multiple topics and a lot of irrelevant information from...
1 VIPS: a Vision-based Page Segmentation Algorithm (2003)
Deng Cai, Shipeng Yu, Ji-rong Wen, Wei-ying Ma, Deng Cai, Shipeng Yu, ...
A new web content structure analysis based on visual representation is proposed in this paper. Many web applications such as information retrieval, information extraction and automatic page...
Improving pseudo-relevance feedback in web information retrieval using web page segmentation (2003)
Shipeng Yu, Deng Cai, Ji-rong Wen, Wei-ying Ma
In contrast to traditional document retrieval, a web page as a whole is not a good information unit to search because it often contains multiple topics and a lot of irrelevant information from...
Extracting content structure for web pages based on visual representation (2003)
Deng Cai, Shipeng Yu, Ji-rong Wen, Wei-ying Ma
Abstract. A new web content structure based on visual representation is proposed in this paper. Many web applications such as information retrieval, information extraction and automatic page...