4 Feature Selection and Document Clustering (2009)
Inderjit Dhillon, Jacob Kogan, Charles Nicholas
Feature selection is a basic step in the construction of a vector space or bag of words model [BB99]. In particular, when the processing task is to partition a given document collection into clusters...
CARROT II and the TREC 11 Web Track (2008)
R. Scott Cost, Srikanth Kallurkar, Hemali Majithia, Charles Nicholas
Abstract. We describe CARROT II, an agent-based architecture for distributed information retrieval and document collection management. CARROT II consists of an arbitrary number of agents, distributed...
ITTALKS: A Case Study in the SemanticWeb and DAML (2008)
R. Scott Cost, Tim Finin, Anupam Joshi, Yun Peng, Charles Nicholas, Ian Soboroff, ...
Techniques for Gigabyte-Scale N-gram Based (2008)
Information Retrieval On, Ethan Miller, Dan Shen, Junli Liu, Charles Nicholas, Ting Chen
this paper, we discuss the implementation techniques that allowed us to use n-gram based retrieval methods on a gigabyte corpus on commodity personal computer hardware. While such techniques have...
Techniques for Gigabyte-Scale N-gram Based Information Retrieval on Personal Computers (2007)
Ethan Miller Dan, Dan Shen, Junli Liu, Charles Nicholas, Ting Chen
this paper, we discuss the implementation techniques that allowed us to use n-gram based retrieval methods on a gigabyte corpus on commodity personal computer hardware. While such techniques have...
Category Theory as a Foundation for Document Processing (2007)
Charles Nicholas, James Mayfield, James Sasaki
Documents, particularly electronic documents that are created, disseminated, and used with computers, have several representations. Users may wish to work with such electronic documents in any of a...
ITtalks: A Case Study in the Semantic Web (2007)
R. Scott Cost, Tim Finin, Anupam Joshi, Yun Peng, Charles Nicholas, Ian Soboroff, ...
languages will improve the automated gathering and processing of information and help integrate multiagent systems with the existing information infrastructure. In this article, the authors describe...
Agents making sense of the Semantic Web (2007)
Lalana Kagal, Filip Perich, Harry Chen, Sovrin Tolia, Youyong Zou, Tim Finin, ...
Abstract. Effective use of the vast quantity of available information and services on the Internet will require multi-agent systems to be tightly integrated with existing web infrastructure. This...
Srikanth Kallurkar, Yongmei Shi, R. Scott Cost, Charles Nicholas, Christopher James, Sowjanya Rajavaram, ...
Abstract. We present the results of UMBC’s participation in the Web and Novelty tracks. We explored various heuristics-based link analysis approaches to the Topic Distillation task. For the novelty...
Ittalks: A case study in the semantic web and daml (2002)
R. Scott Cost, Tim Finin, Anupam Joshi, Yun Peng, Charles Nicholas, Harry Chen, ...
Effective use of the vast quantity of information now available on the web will require the use of “semantic web” markup languages such as the DARPA Agents Markup Language (DAML). Such languages...
Ittalks: A case study in the semantic web and daml (2002)
R. Scott Cost, Tim Finin, Anupam Joshi, Yun Peng, Charles Nicholas, Ian Soboroff, ...
Abstract. Effective use of the vast quantity of information now available on the web will require the use of “Semantic Web ” markup languages such as the DARPA Agent Markup Language (DAML). Such...
Ittalks: A case study in the semantic web and daml (2002)
R. Scott Cost, Tim Finin, Anupam Joshi, Yun Peng, Charles Nicholas, Ian Soboroff, ...
Abstract. Effective use of the vast quantity of information now available on the web will require the use of “Semantic Web ” markup languages such as the DARPA Agent Markup Language (DAML). Such...
Ittalks: A case study in the semantic web and daml (2002)
R. Scott Cost, Tim Finin, Anupam Joshi, Yun Peng, Charles Nicholas, Ian Soboroff, ...
Abstract. Effective use of the vast quantity of information now available on the web will require the use of “Semantic Web ” markup languages such as the DARPA Agent Markup Language (DAML). Such...
Integrating distributed information sources with CARROT II (2002)
R. Scott Cost, Srikanth Kallurkar, Hemali Majithia, Charles Nicholas, Yongmei Shi I
Abstract. We describe CARROT II (C2), an agent-based architecture for distributed information retrieval and document collection management. C2 can consist of an arbitrary number of agents,...
C.: SHOMAR: An Open Architecture for Distributed Intrusion Detection Services (2002)
Jeffrey Undercoffer, Filip Perich, Charles Nicholas
Distributed Intrusion Detection Systems (DIDS) offer an alternative to centralized intrusion detection. Current research indicates that a distributed intrusion detection paradigm may afford greater...
Using latent semantic analysis to find different names for the same entity in free text (2002)
Tim Oates, Vinay Bhat, Vishal Shanbhag, Charles Nicholas
A common problem faced when gathering information from the web is the use of different names to refer to the same entity. For example, the city in India referred to as Bombay in some documents may be...
Ittalks: A case study in the semantic web and daml (2002)
R. Scott Cost, Tim Finin, Anupam Joshi, Yun Peng, Charles Nicholas, Ian Soboroff, ...
Abstract. Effective use of the vast quantity of information now available on the web will require the use of “Semantic Web ” markup languages such as the DARPA Agent Markup Language (DAML). Such...
Ittalks: A case study in the semantic web and daml (2002)
R. Scott Cost, Tim Finin, Anupam Joshi, Yun Peng, Charles Nicholas, Ian Soboroff, ...
Abstract. Effective use of the vast quantity of information now available on the web will require the use of "Semantic Web " markup languages such as the DARPA Agent Markup Language...
Travis Atkison, Kathleen Pensy, Charles Nicholas, David Ebert, Rebekah Atkison, Chris Morris
Abstract. We describe our efforts to analyze network intrusion detection data using information retrieval and visualization tools. By regarding Telnet sessions as documents, which may or may not...
ITTALKS: An application of agents in the semantic web (2001)
Filip Perich, Lalana Kagal, Harry Chen, Sovrin Tolia, Youyong Zou, Tim Finin, ...
Abstract. Effective use of the vast quantity of information now available on the web will require the use of “Semantic Web ” markup languages such as the DARPA Agent Markup Language (DAML). Such...
Performance and Scalability of a Large-scale N-gram Based Information Retrieval System (2000)
Ethan Miller, Dan Shen, Junli Liu, Charles Nicholas
Information retrieval has become more and more important due to the rapid growth of all kinds of information. However, there are few suitable systems available. This paper presents a few approaches...
Performance and Scalability of a Large-Scale N-gram Based Information Retrieval System (2000)
Ethan Miller, Dan Shen, Junli Liu, Charles Nicholas
Information retrieval has become more and more important due to the rapid growth of all kinds of information. However, there are few suitable systems available. This paper presents a few approaches...
Performance and Scalability of a Large-Scale N-gram Based Information Retrieval System (2000)
Ethan Miller, Dan Shen, Junli Liu, Charles Nicholas
Information retrieval has become more and more important due to the rapid growth of all kinds of information. However, there are few suitable systems available. This paper presents a few approaches...
Spotting Topics with the Singular Value Decomposition (1998)
Charles Nicholas, Randall Dahlberg
. The singular value decomposition, or SVD , has been studied in the past as a tool for detecting and understanding patterns in a collection of documents. We show how the matrices produced by the SVD...
Spotting Topics with the Singular Value Decomposition (1998)
Charles Nicholas, Randall Dahlberg
. The singular value decomposition, or SVD , has been studied in the past as a tool for detecting and understanding patterns in a collection of documents. We show how the matrices produced by the SVD...
Spotting topics with the singular value decomposition (1998)
Charles Nicholas, All Dahlberg
Abstract. The singular value decomposition, or SVD, has been studied in the past as a tool for detecting and understanding patterns in a collection of documents. We show how the matrices produced by...
TKQML: A Scripting Tool for Building Agents (1997)
R. Scott Cost, Ian Soboro, Jeegar Lakhani, Tim Finin, Ethan Miller, Charles Nicholas
Abstract. Tcl/Tk is an attractive language for the design of intelligent agents because it allows the quick construction of prototypes and user interfaces; new scripts can easily be bound at runtime...
Agent Development Support for Tcl (1997)
R. Scott Cost, Ian Soboro, Jeegar Lakhani, Tim Finin, Ethan Miller, Charles Nicholas
In the past few years, the explosive growth of the Internet has allowed the construction of "virtual" systems containing hundreds or thousands of individual, relatively inexpensive...
Agent Development Support for Tcl (1997)
R. Scott Cost, Tim Finin, Jeegar Lakhani, Ethan Miller, Charles Nicholas, Ian Soboroff
Tcl/Tk is an attractive language for the design of intelligent agents because it allows the quick construction of prototypes and user interfaces; new scripts can easily be bound at runtime to respond...
Agent Development Support for Tcl (1997)
R. Scott Cost, R. Scott Cost, Ian Soboroff, Ian Soboroff, Jeegar Lakhani, Jeegar Lakhani, ...
In the past few years, the explosive growth of the Internet has allowed the construction of "virtual" systems containing hundreds or thousands of individual, relatively inexpensive...
Agent Development Support for Tcl (1997)
R. Scott Cost, Ian Soboro, Jeegar Lakhani, Tim Finin, Ethan Miller, Charles Nicholas
lications into a distributed framework, using KQML as a communication language. Tcl's embeddable nature allows one to easily add agent communication facilities to existing code. As such, TKQML...
TKQML: A KQML Extension to Tcl (1997)
R. Scott Cost, Ian Soboroff, Jeegar Lakhani, Tim Finin, Ethan Miller, Charles Nicholas
Tcl/Tk is an attractive language for the design of intelligent agents because it allows the quick construction of prototypes and user interfaces; new scripts can easily be bound at runtime to respond...
TKQML: A Scripting Tool for Building Agents (1997)
R. Scott Cost, Ian Soboro, Jeegar Lakhani, Tim Finin, Ethan Miller, Charles Nicholas
. Tcl#Tk is an attractive language for the design of intelligent agents because it allows the quick construction of prototypes and user interfaces; new scripts can easily be bound at runtime to...
Agent Development Support for Tcl (1997)
Scott Cost, Ian Soboroff, Jeegar Lakhani, Tim Finin, Ethan Miller, Charles Nicholas
pplications into a distributed framework, using KQML as a communication language. Tcl's embeddable nature allows one to easily add agent communication facilities to existing code. As such, TKQML...
TKQML: A Scripting Tool for Building Agents (1997)
Scott Cost, Ian Soboroff, Jeegar Lakhani, Tim Finin, Ethan Miller, Charles Nicholas
Tcl/Tk is an attractive language for the design of intelligent agents because it allows the quick construction of prototypes and user interfaces; new scripts can easily be bound at runtime to respond...
Meta-data for Distributed Text Retrieval (1997)
Grace Crowder, Charles Nicholas
duced by every back-end for use by the broker. The first back-end information provider to be adapted for use in CARROT was Telltale [5], which is an n-gram based information retrieval system. We have...
Agent Development Support for Tcl (1997)
Scott Cost, Tim Finin, Jeegar Lakhani, Ethan Miller, Charles Nicholas, Ian Soboeroff
Tcl/Tk is an attractive language for the design of intelligent agents because it allows the quick construction of prototypes and user interfaces; new scripts can easily be bound at runtime to respond...
TKQML: A Scripting Tool for Building Agents (1997)
Scott Cost, Ian Soboroff, Jeegar Lakhani, Tim Finin, Ethan Miller, Charles Nicholas
Tcl/Tk is an attractive language for the design of intelligent agents because it allows the quick construction of prototypes and user interfaces; new scripts can easily be bound at runtime to respond...
Meta-data for Distributed Text Retrieval (1997)
Grace Crowder, Charles Nicholas
duced by every back-end for use by the broker. The first back-end information provider to be adapted for use in CARROT was Telltale [5], which is an n-gram based information retrieval system. We have...
Agent Development Support for Tcl (1997)
R. Scott Cost, Ian Soboroff, Jeegar Lakhani, Tim Finin, Ethan Miller, Charles Nicholas, ...
In the past few years, the explosive growth of the Internet has allowed the construction of "virtual" systems containing hundreds or thousands of individual, relatively inexpensive...
Resource Selection in CAFE: an Architecture for Network Information Retrieval (1996)
Grace Crowder, Charles Nicholas
Introduction The Intelligence Community is faced with the daunting task of extracting useful information from an ever-increasing supply of raw data. This raw data is captured in a variety of formats,...
Using Statistical Properties of Text to Create Metadata (1996)
Grace Crowder, Charles Nicholas
able - can create metadata of metadata ffl Interchangeable - the form must allow queries, documents, and metadata to be compared. 3 CAFE We call our mediated architecture CAFE for Cooperating Agents...
Grace Crowder, Charles Nicholas
Introduction and Problem Statement Given a large, dynamic corpus, distributed over several (possibly hundreds or thousands of ) physical sites traditional Information Retrieval techniques don't...
Snitch: Augmenting Hypertext Documents With A Semantic Net (1993)
James Mayfield, Charles Nicholas
A new model of hypertext, in which text is augmented with a fine-grained semantic net representation of the text, solves several problems found in traditional hypertext models. In the new model,...
Snitch: Augmenting Hypertext Documents With A Semantic Net (1993)
James Mayfield, Charles Nicholas
A new model of hypertext, in which text is augmented with a fine-grained semantic net representation of the text, solves several problems found in traditional hypertext models. In the new model,...
Grace Crowder Crowder, Charles Nicholas
Introduction and Problem Statement Given a large, dynamic corpus, distributed over several (possibly hundreds or thousands of ) physical sites traditional Information Retrieval techniques don't...
Grace Crowder, Charles Nicholas
Introduction and Problem Statement Given a large, dynamic corpus, distributed over several (possibly hundreds or thousands of ) physical sites traditional Information Retrieval techniques don't...