THUIR at TREC 2005 Terabyte Track (2008)
Le Zhao, Rongwei Ceng, Min Zhang, Shaoping Ma
Abstract: IR group of Tsinghua University this year has used its TMiner text retrieval system for indexing and retrieval of the Terabyte track ad hoc and named-page subtasks. In doing the two tasks,...
Zhong Su, Qiang Yang, Hongjiang Zhang, Xiaowei Xu, Yu-hen Hu, Shaoping Ma
Abstract. A great challenge for web site designers is how to ensure users ’ easy access to important web pages efficiently. In this paper we present a clustering-based approach to address this...
Data Cleansing for Web Information Retrieval using Query Independent Features (2008)
Yiqun Liu, Min Zhang, Liyun Ru, Shaoping Ma
We report on a study that was undertaken to better understand what kinds of Web pages are the most useful for web search engine users by exploiting queryindependent features of retrieval target...
Web Page Quality Estimation Based on Linear Discriminant Function 1 (2008)
Rongwei Cen, Yiqun Liu, Min Zhang, Liyun Ru, Shaoping Ma
With the growth of web data, how to estimate web page quality effectively and rapidly becomes more and more important for web information retrieval and knowledge discovery. This paper analyzes the...
THUIR in TREC2003: HARD Experiments (2008)
Liang Ma, Wei Tan, Qunxiu Chen, Shaoping Ma, Shuicai Shi, Shibin Xiao, ...
In this paper, we describe ideas and related experiments of Tsinghua University IR group in TREC-12 HARD Track. In this track, we focus on an automatic delivering mechanism, which combine the...
THUIR at TREC 2005 Terabyte Track 1 (2008)
Le Zhao, Rongwei Ceng, Min Zhang, Shaoping Ma
Abstract: IR group of Tsinghua University this year has used its TMiner text retrieval system for indexing and retrieval of the Terabyte track ad hoc and named-page subtasks. In doing the two tasks,...
Yijiang Jin, Wei Qi, Min Zhang, Shaoping Ma
This year, Tsinghua University Information Retrieval Group (THUIR) participated in the terabyte track of TREC for the first time. Since the document collection is as large as about 426G and we do not...
Wei Tan, Qunxiu Chen, Shaoping Ma
In this paper, we describe ideas and related experiments of Tsinghua University IR group in TREC 2004 QA track. In this track, our system consists three components: Question analysis, Information...
Yiqun Liu, Canhui Wang, Min Zhang, Shaoping Ma
In this year's TREC Web Track research, THUIR participated in the Mixed-query Task. This task involves a single query set comprising 3 kinds of queries (Homepage Finding, Named Page Finding and...
Yiqun Liu, Min Zhang, Rongwei Cen, Liyun Ru, Shaoping Ma
Understanding what kinds of Web pages are the most useful for Web search engine users is a critical task in Web information retrieval (IR). Most previous works used hyperlink analysis algorithms to...
The Nature of Novelty Detection ∗ (2006)
Le Zhao, Min Zhang, Shaoping Ma
Sentence level novelty detection aims at spotting sentences with novel information from an ordered sentence list. In the task, sentences appearing later in the list with no new meanings are...
The Nature of Novelty Detection (2005)
Zhao, Le, Zhang, Min, Ma, Shaoping
Sentence level novelty detection aims at reducing redundant sentences from a sentence list. In the task, sentences appearing later in the list with no new meanings are eliminated. Aiming at a better...
THUIR at TREC 2005: Enterprise Track (2005)
Yupeng Fu Wei, Wei Yu, Yize Li, Yiqun Liu, Min Zhang, Shaoping Ma
IR group of Tsinghua University participated in the expert finding task of TREC2005 enterprise track this year. We developed a novel method which is called document reorganization to solve the...
Segmentation of connected Chinese characters based on genetic algorithm (2005)
Xianghui Wei, Shaoping Ma, Yijiang Jin
The accuracy of segmenting Chinese character, especially connected Chinese characters, is essential for the performance of a Chinese character recognition system. In this paper, a new approach for...
Improved Feature Selection and Redundance Computing (2004)
Liyun Ru, Le Zhao, Min Zhang, Shaoping Ma
in Novelty task of TREC. Our research on this year’s novelty track mainly focused on four aspects: (1) text feature selection and reduction; (2) improved sentence classification in finding relevant...
THUIR at TREC 2003: Novelty, robust and web (2003)
Min Zhang, Chuan Lin, Yiqun Liu, Le Zhao, Shaoping Ma
describing in following sections, respectively. A new IR system named TMiner has been built on which all experiments have been performed. In the system, Primary Feature Model (PFM) [1] has been...
THU TREC 2002: Web Track Experiments (2002)
Min Zhang, Ruihua Song, Chuan Lin, Shaoping Ma, Zhe Jiang, Yijiang Jin, ...
zhangmin99mails.tsinghua.edu.cn 1
THU TREC 2002: Web Track Experiments (2002)
Min Zhang, Ruihua Song, Chuan Lin, Shaoping Ma, Zhe Jiang, Yijiang Jin, ...
Anchor text has been proofed efficient in former TREC experiments on homepage finding task [1] and somewhat useful to ad hoc retrieval by result combination [2]. In this year, our conclusion was...
Incremental learning for profile training in adaptive document filtering,” in TREC11 (2002)
Liang Ma, Qunxiu Chen, Shaoping Ma, Min Zhang, Lianhong Cai
In this paper, we describe our ideas and related experiments in TREC-11 Adaptive Filtering Track. In the track we focused much on a robust way for effective profile training. We developed an...
Min Zhang, Ruihua Song, Chuan Lin, Shaoping Ma, Zhe Jiang, Yijiang Jin, ...
This is the first time that Tsinghua University took part in TREC. In this year's novelty track, our basic idea is to find the key factor that help people find relevant and new information on a...
Using Bayesian classifier in relevant feedback of image retrieval (2000)
Zhong Su, Hongjiang Zhang, Shaoping Ma
Relevance feedback is a powerful technique in contentbased image retrieval (CBIR) and has been an active research area for the past few years. In this paper, we propose a new relevance feedback...