Wessel Kraaij

Search and Retrieval General Terms (2009)

Edgar Meij, Dolf Trieschnigg, Maarten De Rijke, Wessel Kraaij

In many collections, documents are annotated using concepts from a structured knowledge source such as an ontology or thesaurus. Examples include the news domain [7], where each news item is...

PAPER SIGIR’s 30th anniversary: an analysis of trends in IRresearch and the topology of its community (2009)

Djoerd Hiemstra, Claudia Hauff, Franciska De Jong, Wessel Kraaij

This paper presents an analysis of all SIGIR proceedings to date in order to summarize what IR researchers discussed over the years, where they are from, and whether subcommunities can be identified,...

MeSH Up: effective MeSH text classification for improved document retrieval (2009)

Trieschnigg, Dolf, Pezik, Piotr, Lee, Vivian, Jong De, Franciska, Kraaij, Wessel, Rebholz-Schuhmann, Dietrich

Motivation: Controlled vocabularies such as the Medical Subject Headings (MeSH) thesaurus and the Gene Ontology (GO) provide an efficient way of accessing and organizing biomedical information by...

High-level feature detection from video in TRECVid: a 5-year retrospective of achievements (2009)

Smeaton, Alan F., Over, Paul, Kraaij, Wessel

Successful and effective content-based access to digital video requires fast, accurate and scalable methods to determine the video content automatically. A variety of contemporary approaches to this...

High-level feature detection from video in TRECVid: a 5-year retrospective of achievements (2009)

Smeaton, Alan F., Over, Paul, Kraaij, Wessel

Successful and effective content-based access to digital video requires fast, accurate and scalable methods to determine the video content automatically. A variety of contemporary approaches to this...

MeSH Up: effective MeSH text classification for improved document retrieval (2009)

Trieschnigg, Dolf, Pezik, Piotr, Lee, Vivian, De Jong, Franciska, Kraaij, Wessel, Rebholz-Schuhmann, Dietrich

Motivation: Controlled vocabularies such as the Medical Subject Headings (MeSH) thesaurus and the Gene Ontology (GO) provide an efficient way of accessing and organizing biomedical information by...

Conceptual language models for domain-specific retrieval (2009)

Meij, Edgar, Trieschnigg, Dolf, Rijke De, Maarten, Kraaij, Wessel

Over the years, various meta-languages have been used to manually enrich documents with conceptual knowledge of some kind. Examples include keyword assignment to citations or, more recently, tags to...

Abstract (2008)

Djoerd Hiemstra, Wessel Kraaij

In this paper we present the language modeling approach to information retrieval as a toolbox to systematically combine information from different sources. Four TREC subtasks (Ad Hoc, Entry Page,...

Graduation committee: Prof. dr. F.M.G. de Jong, promotor (2008)

Wessel Kraaij, Prof Dr. T. Huibers, Prof Dr. W. Jonker, Prof J. Odijk, ...

ter verkrijging van de graad van doctor aan de Universiteit Twente, op gezag van de rector magnificus, prof. dr. F.A. van Vught, volgens besluit van het College voor Promoties in het openbaar te...

TRECVID 2008 - goals, tasks, data, evaluation mechanisms and metrics (2008)

Over, Paul, Awad, George M., Rose, Travis, Fiscus, Jon, Kraaij, Wessel, Smeaton, Alan F.

The TREC Video Retrieval Evaluation (TRECVID) 2008 is a TREC-style video analysis and retrieval evaluation, the goal of which remains to promote progress in content-based exploitation of digital...

TRECVID 2008 - goals, tasks, data, evaluation mechanisms and metrics (2008)

Over, Paul, Awad, George M., Rose, Travis, Fiscus, Jon, Kraaij, Wessel, Smeaton, Alan F.

The TREC Video Retrieval Evaluation (TRECVID) 2008 is a TREC-style video analysis and retrieval evaluation, the goal of which remains to promote progress in content-based exploitation of digital...

Cross-language Retrieval at Twente and TNO. (2008)

Dennis Reidsma, Djoerd Hiemstra, Franciska De Jong, Wessel Kraaij

This paper describes the official runs of the Twenty-One group for CLEF-2002. The Twenty-One group participated in the Dutch and Finnish monolingual and the Dutch bilingual tasks. This paper also...

The TREC Video Retrieval Evaluation (TRECVID) (2008)

Paul Over, Tzveta Ianeva, Wessel Kraaij, Alan F. Smeaton

2006 represents the sixth running of a TREC-style video retrieval evaluation, the goal of which remains to promote progress in content-based retrieval

Abstract (2008)

Dolf Trieschnigg, Wessel Kraaij, Martijn Schuemie, Erasmus Mc

The 2006 TREC Genomics evaluation focuses on document, passage and aspect retrieval in the genomics domain. The Erasmus Medical Center, TNO and University of Twente collaborated on an approach...

Abstract Viewing Stemming as Recall Enhancement (2008)

Wessel Kraaij

Previous research on stemming has shown both positive and negative effects on retrieval performance. This paper describes an experiment in which several linguistic and non-linguistic stemmers are...

Experimental Comparison of Multimodal Meeting Browsers (2008)

Wilfried Post, Erwin Elling, Anita Cremers, Wessel Kraaij

Abstract. This paper describes an experimental comparison of three variants of a meeting browser. This browser incorporates innovative, multimodal technologies to enable storage and smart retrieval...

Graduation committee: Prof. dr. F.M.G. de Jong, promotor (2008)

Wessel Kraaij, Prof Dr. T. Huibers, Prof Dr. W. Jonker, Prof J. Odijk, ...

ter verkrijging van de graad van doctor aan de Universiteit Twente, op gezag van de rector magnificus, prof. dr. F.A. van Vught, volgens besluit van het College voor Promoties in het openbaar te...

Measuring concept relatedness using language models (2008)

Dolf Trieschnigg, Edgar Meij, Maarten De Rijke, Wessel Kraaij

Over the years, the notion of concept relatedness has attracted considerable attention. A variety of approaches, based on ontology structure, information content, association, or context have been...

Measuring concept relatedness using language models (2008)

Trieschnigg, Dolf, Meij, Edgar, Rijke De, Maarten, Kraaij, Wessel

Over the years, the notion of concept relatedness has attracted considerable attention. A variety of approaches, based on ontology structure, information content, association, or context have been...

Comparing the Effect of Syntactic vs. Statistical Phrase Indexing Strategies for Dutch (2007)

Wessel Kraaij, Renée Pohlmann, Ren'ee Pohlmann

. In this paper we describe the results of experiments contrasting syntactic phrase indexing with statistical phrase indexing for Dutch texts. Our results showed that we at least need a compound...

SCHEDULE (2007)

Rik De Busser, Regina Barzilay, Paul Clough, Bruce Croft, Norbert Fuhr, Fredric Gey, ...

In cooperation with: Interdisciplinary Centre for Law and IT Research Group Legal Informatics & Information Retrieval IWT Vlaanderen Proceedings editors

UP LIFT (2007)

Wessel Kraaij, Ren Ee Pohlmann

Project coordinator: Prof. Ir. S.P.J. Landsbergen (OTS/IPO)

multidocument (2007)

Wessel Kraaij, Martijn Spitters, Anette Hulth

extraction based on a combination of uni- and

y (2007)

Wessel Kraaij, Djoerd Hiemstra

This paper describes the ocial runs of the Twenty-One group for TREC-8. The Twenty-One group participated in the Ad-hoc, CLIR, Adaptive Filtering and SDR tracks. The main focus of our experiments is...

Cross-language Retrieval at the University of Twente and TNO. (2007)

Dennis Reidsma, Djoerd Hiemstra, Franciska De Jong, Wessel Kraaij

This paper describes the o#cial runs of the Twente/TNO group for CLEF 2002. We participated in the Dutch and Finnish monolingual and the Dutch bilingual tasks. In addition this paper reports on an...

Bibliography Published Papers by Dragomir R. Radev References (2007)

Dragomir R. Radev, Alfred Aho, Shih-fu Chang, Kathleen Mckeown, Dragomir Radev, Bruce Croft, ...

[5] Suresh Bhavnani, Karen Drabenstott, and Dragomir Radev. Towards a unified framework of IR tasks and strategies. In 2001 ASIST Annual

The Influence of Basic Tokenization on Biomedical Document Retrieval (2007)

Trieschnigg, Dolf, Kraaij, Wessel, Jong De, Franciska

Tokenization is a fundamental preprocessing step in Information Retrieval systems in which text is turned into index terms. This paper quantifies and compares the influence of various simple...

Cross Language Information Retrieval for Biomedical Literature (2007)

Schuemie, Martijn, Trieschnigg, Dolf, Kraaij, Wessel

This workshop report discusses the collaborative work of UT, EMC and TNO on the TREC Genomics Track 2007. The biomedical information retrieval task is approached using cross language methods, in...

Evaluation campaigns and TRECVid (2006)

Smeaton, Alan F., Over, Paul, Kraaij, Wessel

The TREC Video Retrieval Evaluation (TRECVid) is an international benchmarking activity to encourage research in video information retrieval by providing a large test collection, uniform scoring...

Evaluation campaigns and TRECVid (2006)

Smeaton, Alan F., Over, Paul, Kraaij, Wessel

The TREC Video Retrieval Evaluation (TRECVid) is an international benchmarking activity to encourage research in video information retrieval by providing a large test collection, uniform scoring...

Concept based passage retrieval for genomics literature (2006)

Trieschnigg, Dolf, Kraaij, Wessel, Schuemie, Martijn

The 2006 TREC Genomics evaluation focuses on document, passage and aspect retrieval in the genomics domain. The Erasmus Medical Center, TNO and University of Twente collaborated on an approach...

The AMI Meeting Corpus: a Pre-Announcement (2005)

Carletta, Jean, Ashby, Simone, Bourban, Sebastien, Flynn, Mike, Guillemot, Mael, Hain, Thomas, ...

The AMI Meeting Corpus is a multi-modal data set consisting of 100 hours of meeting recordings. It is being created in the context of a project that is developing meeting browsing technology and will...

The ami meeting corpus: a pre-announcement (2005)

Carletta, Jean, Ashby, Simone, Bourban, Sebastien, Flynn, Mike, Guillemot, Mael, Hain, Thomas, ...

The AMI Meeting Corpus is a multi-modal data set consisting of 100 hours of meeting recordings. It is being created in the context of a project that is developing meeting browsing technology and will...

The AMI meeting corpus: A pre-announcement (2005)

Jean Carletta, Simone Ashby, Sebastien Bourban, Mike Flynn, Thomas Hain, Jaroslav Kadlec, ...

Abstract. The AMI Meeting Corpus is a multi-modal data set consisting of 100 hours of meeting recordings. It is being created in the context of a project that is developing meeting browsing...

Trecvid 2005 - an overview (2005)

Paul Over, Tzveta Ianeva, Wessel Kraaij, Alan F. Smeaton

TRECVID 2005 represented the fifth running of a TREC-style video retrieval evaluation, the goal of which remained to promote progress in content-based retrieval from digital video via open,...

TRECVID: evaluating the effectiveness of information retrieval tasks on digital video (2004)

Smeaton, Alan F., Over, Paul, Kraaij, Wessel

TRECVID is an annual exercise which encourages research in information retrieval from digital video by providing a large video test collection, uniform scoring procedures, and a forum for...

TRECVID: evaluating the effectiveness of information retrieval tasks on digital video (2004)

Smeaton, Alan F., Over, Paul, Kraaij, Wessel

TRECVID is an annual exercise which encourages research in information retrieval from digital video by providing a large video test collection, uniform scoring procedures, and a forum for...

TREC video retrieval evaluation: a case study and status report (2004)

Smeaton, Alan F., Kraaij, Wessel, Over, Paul

The TREC Video Retrieval Evaluation is a multiyear, international effort, funded by the US Advanced Research and Development Agency (ARDA) and the National Institute of Standards and Technology...

TREC video retrieval evaluation: a case study and status report (2004)

Smeaton, Alan F., Kraaij, Wessel, Over, Paul

The TREC Video Retrieval Evaluation is a multiyear, international effort, funded by the US Advanced Research and Development Agency (ARDA) and the National Institute of Standards and Technology...

Variations on language modeling for information retrieval (2004)

Kraaij, Wessel

Search engine technology builds on theoretical and empirical research results in the area of information retrieval (IR). This dissertation makes a contribution to the field of language modeling (LM)...

Variations on language modeling for information retrieval (2004)

Kraaij, Wessel

Search engine technology builds on theoretical and empirical research results in the area of information retrieval (IR). This dissertation makes a contribution to the field of language modeling (LM)...

Variations on language modeling for information retrieval (2004)

Kraaij, Wessel

Search engine technology builds on theoretical and empirical research results in the area of information retrieval (IR). This dissertation makes a contribution to the field of language modeling (LM)...

Automatic summarization of meeting data: A feasibility study (2004)

Anne Hendrik Buist, Wessel Kraaij, Stephan Raaijmakers

The disclosure of audio-visual meeting recordings is a new challenging domain studied by several large scale research projects in Europe and the US. Automatic meeting summarization is one of the...

Transitive probabilistic CLIR models (2004)

Wessel Kraaij, Franciska De Jong

Transitive translation could be a useful technique to enlarge the number of supported language pairs for a cross-language information retrieval (CLIR) system in a cost-effective manner. The paper...

Transitive probabilistic CLIR models (2004)

Wessel Kraaij, Franciska De Jong

Transitive translation could be a useful technique to enlarge the number of supported language pairs for a cross-language information retrieval (CLIR) system in a cost-effective manner. The paper...

Embedding Web-based Statistical Translation Models in Cross-Language Information Retrieval (2003)

Kraaij, Wessel, Nie, Jian-Yun, Simard, Michel

Although more and more language pairs are covered by machine translation services, there are still many pairs that lack translation resources. Cross-language information retrieval (CLIR) is an...

The TREC Video Retrieval Evaluation (TRECVID): A Case Study and Status Report (2003)

Alan F. Smeaton, Wessel Kraaij, Paul Over

The TREC Video Retrieval Evaluation (TRECVID) is an annual international e#ort, funded by the US Advanced Research and Development Agency (ARDA) and the National Institute of Standards and Technology...

Embedding web-based statistical translation models in cross-language information retrieval (2003)

Wessel Kraaij, Jian-yun Nie, Michel Simard

Although more and more language pairs are covered by machine translation (MT) services, there are still many pairs that lack translation resources. Cross-language information retrieval (CLIR) is an...

Embedding web-based statistical translation models in cross-language information retrieval (2003)

Wessel Kraaij, Jian-yun Nie, Michel Simard

Although more and more language pairs are covered by machine translation services, there are still many pairs that lack translation resources. Cross-language information retrieval (CLIR) is an...

Cross-language Retrieval at the University of Twente and TNO (2003)

Reidsma, Dennis, Hiemstra, Djoerd, Jong De, Franciska, Kraaij, Wessel

This paper describes the official runs of the Twente/TNO group for CLEF 2002. We participated in the Dutch and Finnish monolingual and the Dutch bilingual tasks. In addition this paper reports on an...

The importance of prior probabilities for entry page search (2002)

Wessel Kraaij

An important class of searches on the world-wide-web has the goal to find an entry page (homepage) of an organisation. Entry page search is quite different from Ad Hoc search. Indeed a plain Ad Hoc...

The importance of prior probabilities for entry page search (2002)

Wessel Kraaij

An important class of searches on the world-wide-web has the goal to find an entry page (homepage) of an organisation. Entry page search is quite different from Ad Hoc search. Indeed a plain Ad Hoc...

The importance of prior probabilities for entry page search (2002)

Wessel Kraaij

An important class of searches on the world-wide-web has the goal to find an entry page (homepage) of an organisation. Entry page search is quite different from Ad Hoc search. Indeed a plain Ad Hoc...

Retrieving Web pages using content, links, URLs and anchors (2001)

Thijs Westerveld, Wessel Kraaij, Djoerd Hiemstra

Abstract. For this year’s web track, we concentrated on the entry page finding task. For the content-only runs, in both the ad-hoc task and the entry page finding task, we used an information...

TNO at CLEF-2001: Comparing translation resources (2001)

Wessel Kraaij

This paper describes the official runs of TNO TPD for CLEF-2001. We participated in the monolingual, bilingual and multilingual tasks. The main contribution of this paper is a systematic comparison...

Retrieving Web pages using content, links, URLs and anchors (2001)

Thijs Westerveld, Wessel Kraaij, Djoerd Hiemstra

Abstract. For this year’s web track, we concentrated on the entry page finding task. For the content-only runs, in both the ad-hoc task and the entry page finding task, we used an information...

Different approaches to cross language information retrieval (2001)

Wessel Kraaij, Ren Ee Pohlmann

This paper describes two experiments in the domain of Cross Language Information Retrieval. Our basic approach is to translate queries word by word using machine readable dictionaries. The first...

Twenty-one at clef-2000: Translation resources, merging strategies and relevance feedback (2001)

Ý Djoerd Hiemstra, Wessel Kraaij, Renée Pohlmann, Ý Thijs Westerveld

This paper describes the official runs of the Twenty-One group for CLEF-2000. The Twenty-One group participated in the monolingual, bilingual and multilingual tasks. The following new techniques are...

Combining a mixture language model and Naive Bayes for (2001)

Wessel Kraaij, Martijn Spitters

The TNO system for multi-document summarisation is based on an extraction approach. We combined two statistical methods for sentence selection with a variant of the MMR algorithm. After sentence...

TNO at CLEF-2001: Comparing translation resources (2001)

Wessel Kraaij

This paper describes the official runs of TNO TPD for CLEF-2001. We participated in the monolingual, bilingual and multilingual tasks. The main contribution of this paper is a systematic comparison...

A Language Modeling Approach to Tracking News Events (2000)

Martijn Spitters, Wessel Kraaij

This paper presents the TNO tracking system for the 2000 Topic Detection and Tracking evaluation project (TDT2000). The objective of the TDT tracking task is to track events of interest over time....

Twenty-One at TREC-7: Ad-hoc and Cross-Language Track (1999)

Djoerd Hiemstra, Wessel Kraaij

This paper describes the o cial runs of the Twenty-One group for TREC-7. The Twenty-One group participated in the ad-hoc and the cross-language track and made the following accomplishments: We...

Twenty-One at TREC-7: Ad-hoc and Cross-language track (1999)

Djoerd Hiemstra, Wessel Kraaij

This paper describes the official runs of the Twenty-One group for TREC-7. The Twenty-One group participated in the ad-hoc and the cross-language track and made the following accomplishments: We...

TNO TREC7 site report: SDR and filtering (1999)

Rudie Ekkelenkamp Wessel, Wessel Kraaij, David Van Leeuwen

This paper reports about experiments in the CLIR and filtering track, carried out at TNO-TPD and TNO-TM. TNO-TPD is also a member of the TwentyOne consortium and as such participated in the AdHoc...

Twenty-One at TREC-7: Ad-hoc and Cross-Language Track (1999)

Djoerd Hiemstra, Wessel Kraaij

This paper describes the o cial runs of the Twenty-One group for TREC-7. The Twenty-One group participated in the ad-hoc and the cross-language track and made the following accomplishments: We...

Cross Language Retrieval with the Twenty-One system (1998)

Wessel Kraaij, Djoerd Hiemstra

The EU project Twenty-One will support cross language queries in a multilingual document base. A prototype version of the Twenty-One system has been subjected to the Cross Language track tests in...

Cross Language Retrieval with the Twenty-One system (1998)

Wessel Kraaij, Djoerd Hiemstra

The EU project Twenty-One will support cross language queries in a multilingual document base. A prototype version of the Twenty-One system has been subjected to the Cross Language track tests in...

The effect of syntactic phrase indexing on retrieval performance for Dutch texts (1997)

Renée Pohlmann, Wessel Kraaij

In this paper we describe an experiment with syntactic phrase indexing for Dutch texts. We compare different choices for combining terms to form head-modifier pairs and we also investigate the effect...

The EU project ’Twenty-One’ and cross-language IR (1997)

Wessel Kraaij

TwentyOne is a EU funded project which aims at developing advanced indexing and retrieval techniques for multimedia document bases. The document base consists of documents in four languages: Dutch,...

The Effect of Syntactic Phrase Indexing on Retrieval Performance for Dutch Texts (1997)

Renée Pohlmann, Wessel Kraaij

In this paper we describe an experiment with syntactic phrase indexing for Dutch texts. We compare different choices for combining terms to form head-modifier pairs and we also investigate the effect...

Multilingual functionality in the TwentyOne project (1997)

Wessel Kraaij

TwentyOne is a EU funded project which aims at developing advanced indexing and retrieval techniques for multimedia document bases. The document base consists of documents in four languages: Dutch,...

A Domain Specific Lexicon Acquisition Tool for Cross-Language Information Retrieval (1997)

Djoerd Hiemstra, Franciska De Jong, Wessel Kraaij

With the recent enormous increase of information dissemination via the web as incentive there is a growing interest in supporting tools for cross-language retrieval. In this paper we describe a...

The EU project ’Twenty-One’ and cross-language IR (1997)

Wessel Kraaij

TwentyOne is a EU funded project which aims at developing advanced indexing and retrieval techniques for multimedia document bases. The document base consists of documents in four languages: Dutch,...

Viewing Stemming as Recall Enhancement (1996)

Wessel Kraaij, Ren Ee Pohlmann

Previous research on stemming has shown both positive and negative effects on retrieval performance. This paper describes an experiment in which several linguistic and non-linguistic stemmers are...

Viewing Stemming as Recall Enhancement (1996)

Wessel Kraaij, Renee Pohlmann

Previous research on stemming has shown both positive and negative effects on retrieval performance. This paper describes an experiment in which several linguistic and non-linguistic stemmers are...

Evaluation of a Dutch stemming algorithm (1995)

Wessel Kraaij, Renée Pohlmann

This paper describes the development and evaluation of a suffix stripper for Dutch. We have chosen to modify the stemming algorithm developed by Porter (1980) because it is well known and is...

Evaluation of a Dutch Stemming Algorithm (1995)

Wessel Kraaij, Renée Pohlmann

In state of the art Information Retrieval (IR) systems the most salient problem is to improve recall rates while retaining high precision. A simple recall enhancing technique which can be useful for...

Porter's stemming algorithm for Dutch (1994)

Wessel Kraaij, Renée Pohlmann

A stemming algorithm provides a simple means to enhance Recall in Text Retrieval systems. The paper describes the development of a Dutch version of the Porter stemming algorithm. The stemmer was...

Improving the Precision of a Text Retrieval System with Compound Analysis

Renée Pohlmann, Wessel Kraaij

In this paper we describe research on compound analysis in the UPLIFT information retrieval project. Results of earlier experiments indicated that splitting up compounds in the query and forming new...

Improving the Precision of a Text Retrieval System with Compound Analysis

Ren'ee Pohlmann, Wessel Kraaij

In this paper we describe research on compound analysis in the UPLIFT information retrieval project. Results of earlier experiments indicated that splitting up compounds in the query and forming new...

MeSH Up: effective MeSH text classification for improved document retrieval

Trieschnigg, Dolf, Pezik, Piotr, Lee, Vivian, De Jong, Franciska, Kraaij, Wessel, Rebholz-Schuhmann, Dietrich

Motivation: Controlled vocabularies such as the Medical Subject Headings (MeSH) thesaurus and the Gene Ontology (GO) provide an efficient way of accessing and organizing biomedical information by...