Does patient satisfaction of general practice change over a decade? (2009)
Allan, James, Schattner, Peter, Stocks, Nigel, Ramsay, Emmae
Abstract Background The Patient Participation Program (PPP) was a patient satisfaction survey endorsed by the Royal Australian College of General Practitioners and designed to assist general...
ABSTRACT A Comparison of Statistical Significance Tests for Information Retrieval Evaluation (2009)
Mark D. Smucker, James Allan, Ben Carterette
Information retrieval (IR) researchers commonly use three tests of statistical significance: the Student’s paired t-test, the Wilcoxon signed rank test, and the sign test. Other researchers have...
James Allan, Victor Lavrenko, Ramesh Nallapati
University of Massachusetts participated in all four tasks of the TDT 2002 evaluation. As our primary submission we used the system based on the vector-space model of Information Retrieval. The...
ABSTRACT Classification Models for New Event Detection (2008)
Giridhar Kumaran, James Allan, Andrew Mccallum
New event detection (NED) involves monitoring news streams to detect the stories that report on new events. In this paper we explore the application of machine learning classification techniques for...
Mark D. Smucker, James Allan, Blagovest Dachev
Every day, people widely use information retrieval (IR) systems to find documents that answer their questions. Compared to these IR systems, question answering (QA) systems aim to speed the rate at...
Unsupervised Non-topical Classification of Documents (2008)
Ron Bekkerman, Koji Eguchi, James Allan
We describe the problem of non-topical clustering of documents, the purpose of which is to divide a set of documents into clusters that share some aspect. We present experiments on the British...
Mark D. Smucker, James Allan, Blagovest Dachev
Every day, people widely use information retrieval (IR) systems to answer their questions. We utilized the TREC 2007 complex, interactive question answering (ciQA) track to measure the performance of...
Shlomo Zilberstein, Principal Investigator, James Allan, Co-principal Investigator
This project is a collaboration between the Information Retrieval Center and the Resource-Bounded Reasoning Lab at the University of Massachusetts. It is aimed at developing a dynamic approach for...
Discovering Missing Values in Semi-Structured Databases (2008)
Xing Yi, James Allan, Victor Lavrenko
We explore the problem of discovering multiple missing values in a semi-structured database. For this task, we formally develop Structured Relevance Model (SRM) built on one hypothetical generative...
ABSTRACT Automatic Hypertext Link Typing (2008)
We present entirely automatic methods for gathering docu-ments for a hypertext, linking the set, and annotating those connections with a description of the type (i. e., nature) of the link. Document...
ABSTRACT Event Threading within News Topics (2008)
Ramesh Nallapati, Ao Feng, Fuchun Peng, James Allan
With the overwhelming volume of online news available today, there is an increasing need for automatic techniques to analyze and present news to the user in a meaningful and efficient manner....
Relevance Models for Topic Detection and Tracking ABSTRACT (2008)
Victor Lavrenko, James Allan, Edward Deguzman, Daniel Laflamme, Veera Pollard, Steven Thomas
We extend relevance modeling to the link detection task of Topic Detection and Tracking (TDT) and show that it substantially improves performance. Relevance modeling, a statistical language modeling...
INQUERY and TREC-7 (DRAFT|Workshop notebook version) (2008)
James Allan, Jamie Callan, Mark S, Jinxi Xu, Steven Wegmann
in only four of the tracks that were part of the TREC-7 workshop. We worked on ad-hoc retrieval, ltering, VLC, and the SDR track. This report covers the workdoneoneachtrack successively. We start...
Bibliography Published Papers by Dragomir R. Radev References (2008)
Dragomir R. Radev, Alfred Aho, Shih-fu Chang, Kathleen Mckeown, Dragomir Radev, James Allan, ...
retrieval and language modeling. SIGIR Forum, 37(1), March
Bibliography Published Papers by Dragomir R. Radev References (2008)
Dragomir R. Radev, Alfred Aho, Shih-fu Chang, Kathleen Mckeown, Dragomir Radev, James Allan, ...
retrieval and language modeling. SIGIR Forum, 37(1), March
ÓÒ ×Ù � � ¬Ò � �Ö��Ò� � Ð�Ú�Ð �� × ���Ò ×Ø�ÖØ� � Ø��Ö � � × ÙÖ (2007)
Vikash Kh, Rahul Gupta, James Allan
ÁÒ Ö � �ÒØ Ý��Ö × � ÐÓØ Ó � ÔÖÓ�Ö�× × �� × ���Ò Ñ�� � �Ò Ø��
ABSTRACT Flexible Intrinsic Evaluation of Hierarchical Clustering for TDT (2007)
James Allan, Ao Feng, Alvaro Bolivar
The Topic Detection and Tracking (TDT) evaluation program has included a “cluster detection ” task since its inception in 1996. Systems were required to process a stream of broadcast news stories...
Relevance Models for Topic Detection and Tracking ABSTRACT (2007)
Victor Lavrenko, James Allan, Edward Deguzman, Daniel Laflamme, Veera Pollard, Stephen Thomas
We extend relevance modeling to the link detection task of Topic Detection and Tracking (TDT) and show that it substantially improves performance. Relevance modeling, a statistical language modeling...
Using Text Categorization to Measure the Impact of News on Public Opinion (2007)
James Allan, Alvaro Bolivar, Courthey Wade
Public opinion researchers have investigated the question of whether news reporting affects the results of opinion polls that ask for the most important problem facing the nation. Those studies have...
Abstract. In this paper we investigate a general purpose interactive information organization system. The system organizes documents by placing them into 1-, 2-, or 3-dimensional space based on their...
James Allan, Jaime Carbonell, George Doddington, Jonathan Yamron, Yiming Yang, Umass Amherst
CMU Topic Detection and Tracking (TDT) is a DARPA-sponsored initiative to investigate the state of the art in finding and following new events in a stream of broadcast news stories. The TDT problem...
Paul van Mulbregt, Dragon (2007)
James Allan, Jaime Carbonell Y, George Doddington Z, Jonathan Yamron X, Yiming Yang Y, James Allan Umass, ...
Topic Detection and Tracking (TDT) is a DARPA-sponsored initiative to investigate the state of the art in finding and following new events in a stream of broadcast news stories. The TDT problem...
Text Alignment with Handwritten Documents (2007)
E. Micah Kornfield, R. Manmatha, James Allan
Today's digital libraries increasingly include not only printed text but also scanned handwritten pages and other multimedia material. There are, however, few tools available for manipulating...
ABSTRACT Extraction of Key Words from News Stories (2007)
Ramesh Nallapati, James Allan, Sridhar Mahadevan
In this work, we consider the task of extracting key-words such as key-players, key-locations, key-nouns and key-verbs from news stories. We cast this problem as a classification problem wherein we...
HARD Track Overview in TREC 2003 (2007)
High Accuracy Retrieval, James Allan
Introduction The e#ectiveness of ad-hoc retrieval systems appears to have reached a plateau. After several years of 10% gains every year in TREC, improvements dwindled or even stopped. This lack of...
Research methodology in studies of assessor effort for retrieval evaluation (2007)
As evaluation is an important but difficult part of information retrieval system design and experimentation, evaluation questions have been the subject of much research. An “evaluation study ” is...
An investigation of dirichlet prior smoothing’s performance advantage (2005)
In the language modeling approach to information retrieval, Dirichlet prior smoothing frequently outperforms Jelinek-Mercer smoothing. Both Dirichlet prior and Jelinek-Mercer are forms of linear...
Dirichlet Mixtures for Query Estimation in Information Retrieval Abstract Mark D. (2005)
Treated as small samples of text, user queries require smoothing to better estimate the probabilities of their true model. Traditional techniques to perform this smoothing include automatic query...
track overview in the TREC 2004 - High Accuracy Retrieval from Documents (2005)
The HARD track of TREC 2004 aims to improve the accuracy of information retrieval through the use of three techniques: (1) query metadata that better describes the information need, (2) focused and...
When less is more: Relevance feedback falls short and term expansion succeeds at HARD 2005 (2005)
We used clarification forms to study passage term feedback. When compared against pseudo-relevance feedback with an extremely large external corpus, we found that passage feedback resulted in a...
When will information retrieval be “good enough (2005)
James Allan, Ben Carterette, Joshua Lewis
We describe a user study that examined the relationship between the quality of an Information Retrieval system and the effectiveness of its users in performing a task. The task involves finding...
Incremental test collections (2005)
Corpora and topics are readily available for information retrieval research. Relevance judgments, which are necessary for system evaluation, are expensive; the cost of obtaining them prohibits...
DNA methylation, nucleosome formation and positioning (2005)
Pennings, Sari, Allan, James, Davey, Colin S.
Recent mapping of nucleosome positioning on several long gene regions subject to DNA methylation has identified instances of nucleosome repositioning by this base modification. The evidence for an...
Restrictive Clustering and Metaclustering for self-organizing Document Collections (2004)
Siersdorfer, Stefan, Sizov, Sergej, Järvelin, Kalervo, Allan, James, Bruza, Peter, Sanderson, Mark
This paper addresses the problem of automatically structuring heterogenous document collections by using clustering methods. In contrast to traditional clustering, we study restrictive methods and...
Abstract. A web-based search engine responds to a user’s query with a list of documents. This list can be viewed as the engine’s model of the user’s idea of relevanceöthe engine ‘believes’...
UMass at TREC 2004: Notebook (2004)
Nasreen Abdul-jaleel, James Allan, W. Bruce Croft, O Diaz, Leah Larkey, Xiaoyan Li, ...
The retrieval model implemented in the Indri search engine is an enhanced version of the model described in [30], which combines the language modeling [35] and inference network [38] approaches to...
An exploration of entity models, collective classification and relation description (2004)
Hema Raghavan, James Allan, Andrew Mccallum
Traditional information retrieval typically represents data using a bag of words; data mining typically uses a highly structured database representation. This paper explores the middle ground using a...
Dynamic composition of information retrieval techniques (2004)
Abstract. This paper presents a new approach to information retrieval (IR) based on run-time selection of the best set of techniques to respond to a given query. A technique is selected based on its...
NLP for IR - Natural Language Processing for Information Retrieval (2004)
s are reduced forms of text -- NLP may not be useful in extracting good features since it has already been done n 1990s saw arrival of TREC workshops -- Large collections (gigabytes) -- Full-text...
UMass at TREC 2003: HARD and QA (2004)
Nasreen Abduljaleel, Andres Corrada-emmanuel, Qi Li, Xiaoyong Liu, Courtney Wade, James Allan
• In the HARD track, we developed document metadata to correspond to query metadata requirements; implemented clarification forms based on query expansion, passage retrieval, and clustering; and...
UMass at TREC 2004: Novelty and HARD (2004)
Nasreen Abdul-jaleel, James Allan, W. Bruce Croft, O Diaz, Leah Larkey, Xiaoyan Li, ...
For the TREC 2004 Novelty track, UMass participated in all four tasks. Although finding relevant sentences was harder this year than last, we continue to show marked improvements over the baseline of...
Margaret Connell, Ao Feng, Giridhar Kumaran, Hema Raghavan, Chirag Shah, James Allan
submitted runs for all four tasks, namely, Hierarchical Topic Detection,
Using Bigrams in Text Categorization (2004)
In the past decade a sufficient effort has been expended on attempting to come up with a document representation which is richer than the simple Bag-Of-Words (BOW). One of the widely explored...
A user-centered approach to evaluating topic models (2004)
Diane Kelly, O Diaz, Nicholas J. Belkin, James Allan
Abstract. This paper evaluates the automatic creation of personal topic models using two language model-based clustering techniques. The results of these methods are compared with user-defined topic...
Restrictive Clustering and Metaclustering for self-organizing Document Collections (2004)
Siersdorfer, Stefan, Sizov, Sergej, Järvelin, Kalervo, Allan, James, Bruza, Peter, Sanderson, Mark
This paper addresses the problem of automatically structuring heterogenous document collections by using clustering methods. In contrast to traditional clustering, we study restrictive methods and...
A determining influence for CpG dinucleotides on nucleosome positioning in vitro (2004)
Davey, Colin S., Pennings, Sari, Reilly, Carmel, Meehan, Richard R., Allan, James
DNA sequence information that directs the translational positioning of nucleosomes can be attenuated by cytosine methylation when a short run of CpG dinucleotides is located close to the dyad axis of...
Robust Techniques for Organizing and Retrieving Spoken Documents (2003)
Information retrieval tasks such as document retrieval and topic detection and tracking (TDT) show little degradation when applied to speech recognizer output. We claim that the robustness of the...
An adaptive local dependency language model: Relaxing the naive Bayes assumption (2003)
We describe a new probabilistic approach in the language modeling framework that captures adaptively the local term dependencies in documents. The new model works by boosting scores of documents that...
Details on Stemming in the Language Modeling Framework (2003)
We incorporate stemming into the language modeling framework. The work is suggested by the notion that stemming increases the numbers of word occurrences used to estimate the probability of a word...
Browsing-based User Language Models for Information Retrieval (2003)
Traditional information retrieval systems have ignored the potential improvement in precision provided by personalization. We present a study of the behavior and evaluation of personalized...
Robust Techniques for Organizing and Retrieving Spoken Documents (2003)
Information retrieval tasks such as document retrieval and topic detection and tracking (TDT) show little degradation when applied to speech recognizer output. We claim that the robustness of the...
Robust Techniques for Organizing and Retrieving Spoken Documents (2003)
Information retrieval tasks such as document retrieval and topic detection and tracking (TDT) show little degradation when applied to speech recognizer output. We claim that the robustness of the...
In this article the author reviews the recent exchange in this Journal between David Dyzenhaus and Matthew Kramer on the merits of legal positivism. He then offers his own modest proposal regarding...
Sympathy and Antipathy: Essays Legal and Philosophical (2002)
The search for a moral standard of right and wrong which is external to any particular evaluator, and so escapes subjectivity, has a long history. Jeremy Bentham, in trying to find such a standard,...
Sympathy and Antipathy: Essays Legal and Philosophical (2002)
The search for a moral standard of right and wrong which is external to any particular evaluator, and so escapes subjectivity, has a long history. Jeremy Bentham, in trying to find such a standard,...
UMass at TREC 2002: Cross language and novelty tracks (2002)
Leah S. Larkey, James Allan, Margaret E. Connell, Alvaro Bolivar, Courtney Wade
{larkey I allan I cormell I alvarob I cwade}cs.umass.edu The University of Massachusetts participated in the cross-language and novelty tracks this year. The cross-language submission was...
Perspectives on information retrieval and speech (2002)
Abstract. Several years of research have suggested that the accuracy of spoken document retrieval systems is not adversely affected by speech recognition errors. Even with error rates of around 40%,...
Knowledge Management and Speech recognition (2002)
Knowledge Management (KM) generally refers to techniques that allow an organization to capture information and practices of its members and customers, in order that the stored knowledge can be...
UMass at TREC 2002: Cross language and novelty tracks (2002)
Leah S. Larkey, James Allan, Margaret E. Connell, Alvaro Bolivar, Courtney Wade
The University of Massachusetts participated in the cross-language and novelty tracks this year. The cross-language submission was characterized by combination of evidence to merge results from two...
Topic models for summarizing novelty (2001)
James Allan, Rahul Gupta, Vikas Kh
We define temporal summaries of news stories as extracting as few sentences as possible from each event within a news topic, where the stories are presented one at a time and sentences from a story...
Topic models for summarizing novelty (2001)
James Allan, Rahul Gupta, Vikas Kh
We define temporal summaries of news stories as extracting as few sentences as possible from each event within a news topic, where the stories are presented one at a time and sentences from a story...
Monitoring the News: a TDT demonstration system (2001)
Victor Lavrenko, Anton Leuski, James Allan
We describe a demonstration system built upon Topic Detection and Tracking (TDT) technology. The demonstration system monitors a stream of news stories, organizes them into clusters that represent...
Mining of concurrent text and time series (2000)
Victor Lavrenko, Matt Schmill, Dawn Lawrie, Paul Ogilvie, David Jensen, James Allan
Language models for financial news recommendation (2000)
Victor Lavrenko, Matt Schmill, Dawn Lawrie, Paul Ogilvie, David Jensen, James Allan
James Allan, Victor Lavrenko, David Frey, Vikas Kh
We spent a fair amount of time this year rewriting our TDT system in order to provide more flexibility and to better integrate the various components. The time spent rearchitecting the code, learning...
Mining of concurrent text and time series (2000)
Victor Lavrenko, Matt Schmill, Dawn Lawrie, Paul Ogilvie, David Jensen, James Allan
Language models for financial news recommendation (2000)
Victor Lavrenko, Matt Schmill, Dawn Lawrie, Paul Ogilvie, David Jensen, James Allan
Improving interactive retrieval by combining ranked lists and clustering (2000)
We study the problem of organizing the documents returned by an information retrieval system in response to a natural language query. We consider two well-known approaches – the ranked list and...
James Allan, Victor Lavrenko, David Frey, Vikas Kh
We spent a fair amount of time this year rewriting our TDT system in order to provide more flexibility and to better integrate the various components. The time spent rearchitecting the code, learning...
Language models for financial news recommendation (2000)
Victor Lavrenko, Matt Schmill, Dawn Lawrie, Paul Ogilvie, David Jensen, James Allan
We present a unique approach to identifying news stories that in uence the behavior of nancial markets. Speci-cally, we describe the design and implementation of nalyst, a system that can recommend...
Mining of concurrent text and time series (2000)
Victor Lavrenko, Matt Schmill, Dawn Lawrie, Paul Ogilvie, David Jensen, James Allan
We present a unique approach to identifying news stories that in uence the behavior of nancial markets. We describe the design and implementation of nalyst, a system for predicting trends in stock...
Lighthouse is an on-line interface for a Web-based information retrieval system. It integrates two known presentations of the retrieved results -- the ranked list and clustering visualization -- in a...
Lighthouse: Showing the Way to Relevant Information (2000)
Lighthouse is an on-line interface for a Web-based information retrieval system. It accepts queries from a user, collects the retrieved documents from the search engine, organizes and presents them...
Improving Interactive Retrieval by Combining Ranked Lists and Clustering (2000)
We study the problem of organizing the documents returned by an information retrieval system in response to a natural language query. We consider two well-known approaches -- the ranked list and...
Mining of concurrent text and time series (2000)
Victor Lavrenko, Matt Schmill, Dawn Lawrie, Paul Ogilvie, David Jensen, James Allan
Language models for financial news recommendation (2000)
Victor Lavrenko, Matt Schmill, Dawn Lawrie, Paul Ogilvie, David Jensen, James Allan
1
Mining of concurrent text and time series (2000)
Victor Lavrenko, Matt Schmill, Dawn Lawrie, Paul Ogilvie, David Jensen, James Allan
Improving interactive retrieval by combining ranked lists and clustering (2000)
We study the problem of organizing the documents returned by an information retrieval system in response to a natural language query. We consider two well-known approaches – the ranked list and...
Language models for financial news recommendation (2000)
Victor Lavrenko, Matt Schmill, Dawn Lawrie, Paul Ogilvie, David Jensen, James Allan
Mining of concurrent text and time series (2000)
Victor Lavrenko, Matt Schmill, Dawn Lawrie, Paul Ogilvie, David Jensen, James Allan
ABSTRACT We present a unique approach to identifying news stories that influence the behavior of financial markets. We describe the design and implementation of AEnalyst, a system for predicting...
Colin Woodard Barringer, Univerisity Of Massachusetts—amherst, Macintosh Os, X Profs, James Allan, Stephen Forrest, ...
Research position on projects related to motor control, machine learning or computational neuroscience.
UMASS Approaches to Detection and Tracking at TDT2 (1999)
Ron Papka, James Allan, Victor Lavrenko
The following work describes our solutions to the detection and tracking problems defined by the Topic Detection and Tracking (TDT2) research initiative. We discuss the implementation and results of...
Extracting Significant Time Varying Features from Text (1999)
We propose a simple statistical model for the frequency of occurrence of features in a stream of text. Adoption of this model allows us to use classical significance tests to filter the stream for...
Topic detection and tracking pilot study: Final report (1998)
James Allan, Jaime Carbonell, George Doddington, Jonathan Yamron, Yiming Yang, James Allan Umass, ...
CMU, ¢ DARPA, £ Topic Detection and Tracking (TDT) is a DARPA-sponsored initiative to investigate the state of the art in finding and following new events in a stream of broadcast news stories. The...
Topic detection and tracking pilot study: Final report (1998)
James Allan, Jaime Carbonell, George Doddington, Jonathan Yamron, Yiming Yang, James Allan Umass, ...
Topic Detection and Tracking (TDT) is a DARPA-sponsored initiative to investigate the state of the art in finding and following new events in a stream of broadcast news stories. The TDT problem...
Visual Interactions with a Multidimensional Ranked List (1998)
Anton Leouski And, Anton Leouski, James Allan
begin with, the documents are represented as vectors of terms. These vectors are huge: their size is equal to the vocabulary size of the retrieved set. For each document its vector defines a point in...
Evaluating a Visual Information Retrieval Interface: AspInquery at TREC-6 (1998)
Russell Swan James, James Allan, Don Byrd
We built two Information Retrieval systems that were targeted for the TREC-6 "aspect oriented" retrieval track. The systems were built to test the usefulness of different visualizations in...
Visual Interactions with a Multidimensional Ranked List (1998)
Performance analysis of an interactive visualization system generally requires an extensive user study, a method that is very expensive and that often yields inconclusive results. To do a successful...
On-Line New Event Detection using Single Pass Clustering (1998)
This paper discusses the implementation and evaluation of a new-event detection system. We focus on a strict on-line setting, in that the system must indicate whether the current document contains or...
Document Classification using Multiword Features (1998)
We investigate the use of multiword query features to improve the effectiveness of text-retrieval systems that accept natural-language queries. A relevance feedback process is explained that expands...
Aspect Windows, 3-D Visualizations, and Indirect Comparisons of Information Retrieval Systems (1998)
We built two Information Retrieval systems that were targeted for the TREC-6 "aspect oriented " retrieval track. The systems were built to test the usefulness of different visualizations in...
On-line New Event Detection and Tracking (1998)
James Allan, Ron Papka, Victor Lavrenko
We define and describe the related problems of new event detection and event tracking within a stream of broadcast news stories. We focus on a strict on-line setting---i.e., the system must make...
Evaluating a Visual Navigation System for a Digital Library (1998)
. In this paper we investigate a general purpose interactive information organization system. The system organizes documents by placing them into 1-, 2-, or 3-dimensional space based on their...
James Allan, Victor Lavrenko, Ron Papka
This paper introduces Event Tracking , a new application of Information Retrieval technology with interesting research and evaluation questions. We describe the problem, a pilot corpus of news...
Topic Detection and Tracking Pilot Study (1998)
James Allan, Jaime Carbonell, George Doddington, Jonathan Yamron, Yiming Yang, Umass Amherst, ...
Topic Detection and Tracking (TDT) is a DARPA-sponsored initiative to investigate the state of the art in finding and following new events in a stream of broadcast news stories. The TDT problem...
On-Line New Event Detection using Single Pass Clustering (1998)
This paper discusses the implementation and evaluation of a new-event detection system. We focus on a strict on-line setting, in that the system must indicate whether the current document contains or...
On-line New Event Detection and Tracking (1998)
James Allan, Ron Papka, Victor Lavrenko
We define and describe the related problems of new event detection and event tracking within a stream of broadcast news stories. We focus on a strict on-line setting---i.e., the system must make...
James Allan, Jamie Callan, Mark S, Jinxi Xu, Steven Wegmann
in only four of the tracks that were part of the TREC-7 workshop. We worked on ad-hoc retrieval, ltering, VLC, and the SDR track. This report covers the work done on each track successively. We start...
Interactive cluster visualization for information retrieval (1997)
Abstract. In this paper we investigate a general purpose interactive information organization system. The system organizes documents by placing them into 1-, 2-, or 3dimensional space based on their...
TREC-6 Interactive Track Report, Part 1: Experimental Procedure and Initial Results (1997)
Don Byrd, Russell Swan, James Allan
The TREC-6 Interactive Track concentrated on Aspect-Oriented Retrieval: finding at least one document that covers each aspect of relevance to a given topic, as opposed to the usual...
Why Bigger Windows Are Better Than Smaller Ones (1997)
Ron Papka And, Ron Papka, James Allan
We investigate the use of multi-term query concepts to improve the performance of textretrieval systems that accept "natural-language" queries. A relevance feedback process is explained...
Interactive Cluster Visualization for Information Retrieval (1997)
James Allan, Anton V. Leouski, Russell C. Swan
This study investigates the ability of cluster visualization to help a user rapidly identify relevant documents. It provides added support for the truth of the Cluster Hypothesis on retrieved...
TREC-6 Interactive Track Report - Part 1: Experimental Procedure and. . . (1997)
Don Byrd, Russell Swan, James Allan
Introduction The TREC-6 Interactive Track concentrated on Aspect-Oriented Retrieval: finding at least one document that covers each aspect of relevance to a given topic, as opposed to the usual...
Automatic Hypertext Link Typing (1996)
We present entirely automatic methods for gathering documents for a hypertext, linking the set, and annotating those connections with a description of the type (i.e., nature) of the link. Document...
Does Navigation Require More than One Compass? (1996)
We present a methodology for automatically constructing structural hyperlinks for navigation and retrieval in large corpora. A structural hyperlink connects components of a document that have...
Incremental Relevance Feedback for Information Filtering (1996)
We use data from the TREC routing experiments to explore how relevance feedback can be applied incrementally --- using a few judged documents each time --- to achieve results that are as good as if...
Improving Interactive Information Retrieval Effectiveness with 3-D Graphics (1996)
Recall-oriented information retrieval requires locating as many documents as possible that are relevant to a query. Traditional information retrieval systems present results only in a ranked list....
Does Navigation Require More than One Compass? (1996)
We present a methodology for automatically constructing structural hyperlinks for information navigation and retrieval in distributed corpora. A structural hyperlink connects components of a document...
Automatic Hypertext Construction (1995)
The unprecedented growth of the World Wide Web illustrates the importance of hypertext as a method for organizing the rapidly expanding amount of on-line text. As document collections become larger...
Relevance Feedback With Too Much Data (1995)
Modern text collections often contain large documents which span several subject areas. Such documents are problematic for relevance feedback since inappropriate terms can easily be chosen. This...
Recent Experiments with INQUERY (1995)
James Allan, Lisa Ballesteros, James P. Callan, W. Bruce Croft, Zhihong Lu
this paper focuses on relevant differences to the previously published algorithms. 1 Description of Ad-Hoc Experiments
Selective Text Utilization and Text Traversal (1995)
Many large collections of full-text documents are currently stored in machine-readable form and processed automatically in various ways. These collections may include different types of documents,...
Recent Experiments with INQUERY (1995)
James Allan, Lisa Ballesteros, James P. Callan, W. Bruce Croft, Zhihong Lu
this paper focuses on relevant differences to the previously published algorithms. 1 Description of Ad-Hoc Experiments
Recent Experiments with INQUERY (1995)
James Allan, Lisa Ballesteros, James P. Callan, W. Bruce Croft, Zhihong Lu
this paper focuses on relevant differences to the previously published algorithms. 1 Description of Ad-Hoc Experiments
Recent Experiments with INQUERY (1995)
James Allan, Lisa Ballesteros, James P. Callan, W. Bruce Croft, Zhihong Lu
this paper focuses on relevant differences to the previously published algorithms. 1 Description of Ad-Hoc Experiments
Automatic Text Decomposition and Structuring (1994)
Sophisticated text similarity measurements are used to determine relationships between naturallanguage texts and text segments. The resulting linked hypertext maps are used to identify different text...
The Effect of Adding Relevance Information in a Relevance Feedback Environment (1994)
Chris Buckley, Gerard Salton, James Allan
The effects of adding information from relevant documents are examined in the TREC routing environment. A modified Rocchio relevance feedback approach is used, with a varying number of relevant...
Automatic Routing and Ad-hoc Retrieval Using SMART : TREC 2 (1994)
Chris Buckley, James Allan, Gerard Salton
The Smart information retrieval project emphasizes completely automatic approaches to the understanding and retrieval of large quantities of text. We continue our work in the TREC 2 environment,...
The effect of adding relevance information in a relevance feedback environment (1994)
Chris Buckley*gerard Salton, James Allan
The effects of adding information from relevant documents are examined in the TREC routing envi-ronment. A modified Rocchio relevance feedback approach is used, with a varying number of relevant...
Selective Text Utilization and Text Traversal (1993)
Many large collections of full-text documents are currently stored in machine-readable form and processed automatically in various ways. These collections may include different types of documents,...
Approaches to Passage Retrieval in Full Text Information Systems (1993)
Salton, Gerard, Allan, James, Buckley, Chris
Large collections of full-text documents are now commonly used in automated information retrieval. When the stored document texts are long, the retrieval of complete documents may not be in the...
Hume and reason : a sceptical theory of morality and law / (1993)
Thesis (Ph. D.)--University of Hong Kong, 1994.
Hume and reason : a sceptical theory of morality and law / (1993)
Thesis (Ph. D.)--University of Hong Kong, 1994.
Information Agents for Building Hyperlinks (1993)
James Allan, Jim Davis, Dean Krafft, Daniela Rus, Devika Subramanian
In [13] we describe an architecture for building special-purpose agents for solving the information capture and access problem in large, multi-media data environments with uncertainty in data...
Selective text utilization and text traversal (1993)
Many large collections of full-text documents are currently stored in machine-readable form and processed automatically in various ways. These collections may include different types of documents,...
Selective Use of Full-Text Databases (1992)
Salton, Gerard, Allan, James, Buckley, Chris
Large files of natural-language text are now available for automatic processing in machine readable form. Such text files may include documents of textbook size, medium-size newspaper articles and...
Automatic Structuring and Retrieval of Large Text Files (1992)
Salton, Gerard, Allan, James, Buckley, Chris
In many operational environments, large text files must be processed covering a wide variety of different topic areas. Aids must then be provided to the user that permit collection browsing and make...
Automatic Structuring of Text Files (1992)
Gerard Salton Chris, Chris Buckley, James Allan
In many practical information retrieval situations, it is necessary to process heterogeneous text databases that vary greatly in scope and coverage, and deal with many different subjects. In such an...
Automatic Structuring of Text Files (1991)
Salton, Gerard, Buckley, Chris, Allan, James
In many practical information retrieval situations, it is necessary to process heterogeneous text databases that vary greatly in scope and coverage, and deal with many different subjects. In such an...
Buckle, Robin, Balmer, Michael, Yenidunya, Ali, Allan, James
Core histone octamers reconstituted in vitro onto DNA fragments containing the chicken β-globin gene promoter are precisely positioned with respect to the underlying DNA sequence [1]. Here we show...
The Linker Histone Stoichiometry of Chicken Erythrocyte Chromatin in situ (1988)
CATTINI, Peter A., HARBORNE, Nerina, ALLAN, James
Isolated chicken erythrocyte chromatins (native, linker histone-depleted and H5-reconstitutes) and swollen chicken erythrocyte nuclei, have been embedded, sectioned, and compared in the electron...
Precise nucleosome positioning in the promoter of the chicken {beta}A globin gene (1988)
Kefalas, Panagiotis, Gray, Fiona C., Allan, James
Histone octamers were reconstituted onto 5' endlabelled DNA fragments derived from the promoter region of the chicken βA globin gene. The location of the reconstituted histone octamer with respect...
Natural and artificial diagenesis of coal macerals [microform] / (1975)
Thesis (doctoral)--University of Newcastle upon Tyne, 1975.