Assessing Quality of Product Reviews (2008)
In the past few years, there has been an increasing interest in mining opinions from product reviews [3][4][5]. However, due to the lack of editorial and quality control, reviews on products vary...
Mining Latent Associations of Objects Using a Typed Mixture Model (2008)
Shenghua Bao I, Yunbo Cao, Bing Liu, Yong Yu I, Hang Li
This paper studies the problem of discovering latent associations among objects in text documents. Specifically, given two sets of objects and various types of co-occurrence data concerning the...
A Supervised Learning Approach to Entity Search (2008)
Guoping Hu, Jingjing Liu, Hang Li, Yunbo Cao, Jian-yun Nie, Jianfeng Gao
3. Département d'informatique et de recherche opérationnelle Université de Montréal Abstract. In this paper we address the problem of entity search. Expert search and time search are used as...
Abstract Web Page Title Extraction and Its Application 1 (2008)
Yewei Xue, Yunhua Hu, Guomao Xin, Ruihua Song, Shuming Shi, Yunbo Cao, ...
This paper is concerned with automatic extraction of titles from the bodies of HTML documents (web pages). Titles of HTML documents should be correctly defined in the title fields by the authors;...
IEEE IEEE INTELLIGENT SYSTEMS Published by the IEEE Computer Society (2008)
Using Bilingual Web, Hang Li, Yunbo Cao, Cong Li
ry consultation for words and phrases through two basic features: mouse hovering and searching. When a user puts the cursor on a word such as cellular,ERW displays the word and its translations in a...
Searching Documents Based on Relevance and Type (2007)
Jun Xu, Yunbo Cao, Hang Li, Nick Craswell, Yalou Huang
Abstract. This paper extends previous work on document retrieval and document type classification, addressing the problem of ‘typed search’. Specifically, given a query and a designated document...
Adapting ranking SVM to document retrieval (2006)
Yunbo Cao, Jun Xu, Tie-yan Liu, Hang Li, Yalou Huang, Hsiao-wuen Hon
The paper is concerned with applying learning to rank to document retrieval. Ranking SVM is a typical method of learning to rank. We point out that there are two factors one must consider when...
Adapting ranking SVM to document retrieval (2006)
Yunbo Cao, Jun Xu, Tie-yan Liu, Hang Li, Yalou Huang, Hsiao-wuen Hon
The paper is concerned with applying learning to rank to document retrieval. Ranking SVM is a typical method of learning to rank. We point out that there are two factors one must consider when...
Cost-sensitive learning of SVM for ranking (2006)
Jun Xu, Yunbo Cao, Hang Li, Yalou Huang
Abstract. In this paper, we propose a new method for learning to rank. ‘Ranking SVM ’ is a method for performing the task. It formulizes the problem as that of binary classification on instance...
Cost-sensitive learning of SVM for ranking (2006)
Jun Xu, Yunbo Cao, Hang Li, Yalou Huang
Abstract. In this paper, we propose a new method for learning to rank. ‘Ranking SVM ’ is a method for performing the task. It formulizes the problem as that of binary classification on instance...
Ranking Definitions with Supervised Learning Methods (2005)
This paper is concerned with the problem of definition search. Specifically, given a term, we are to retrieve definitional excerpts of the term and rank the extracted excerpts according to their...
Title Extraction from Bodies of HTML Documents and its Application to Web Page Retrieval (2005)
Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Shuming Shi, Yunbo Cao, ...
This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, in reality HTML...
Ranking Definitions with Supervised Learning Methods (2005)
This paper is concerned with the problem of definition search. Specifically, given a term, we are to retrieve definitional excerpts of the term and rank the extracted excerpts according to their...
Title Extraction from Bodies of HTML Documents and its Application to Web Page Retrieval (2005)
Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Shuming Shi, Yunbo Cao, ...
This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, in reality HTML...
Uncertainty Reduction in Collaborative Bootstrapping: Measure and Algorithm (2003)
This paper proposes the use of uncertainty reduction in machine learning methods such as co-training and bilingual bootstrapping, which are referred to, in a general term, as ‘collaborative...
Uncertainty Reduction in Collaborative Bootstrapping: Measure and Algorithm (2003)
This paper proposes the use of uncertainty reduction in machine learning methods such as co-training and bilingual bootstrapping, which are referred to, in a general term, as ‘collaborative...
Uncertainty Reduction in Collaborative Bootstrapping: Measure and Algorithm (2003)
This paper proposes the use of uncertainty reduction in machine learning methods such as co-training and bilingual bootstrapping, which are referred to, in a general term, as ‘collaborative...
Uncertainty Reduction in Collaborative Bootstrapping: (2003)
Measure And Algorithm, Yunbo Cao
This paper proposes the use of uncertainty reduction in machine learning methods such as co-training and bilingual bootstrapping, which are referred to, in a general term, as `collaborative...
Base noun phrase translation using web data and the EM algorithm (2002)
We consider here the problem of Base Noun Phrase translation. We propose a new method to perform the task. For a given Base NP, we first search its translation candidates from the web. We next...
Base noun phrase translation using web data and the EM algorithm (2002)
We consider here the problem of Base Noun Phrase translation. We propose a new method to perform the task. For a given Base NP, we first search its translation candidates from the web. We next...
The traditional problem that exists from the time of Tower of Babel becomes more serious in the internet era. That is how we read and write foreign languages. Figure 1 shows an estimate of the...