Combining Semantics, Context, and Statistical Evidence in Genomics Literature Search (2009)
Jay Urbain, Nazli Goharian, Ophir Frieder
Abstract—We present an information retrieval model for combining evidence from concept-based semantics, term statistics, and context for improving search precision of genomics literature by...
Temporal Analysis of a Very Large Topically Categorized Web Query Log (2009)
Steven M. Beitzel, Eric C. Jensen, Abdur Chowdhury, Ophir Frieder, David Grossman
The authors review a log of billions of Web queries that constituted the total query traffic for a 6-month period of a general-purpose commercial Web search service. Previously, query logs were...
HAPI: Hardware Assisted Pruned Index for Consumer Electronics (2009)
Pruned Index (HAPI) component. HAPI is a content indexing device based on a modified inverted index structure. HAPI accommodates patterns of different lengths and supports a varied posting list...
Yue Yu, Shangping Ren, Ophir Frieder
We apply interval-based timing constraint satisfaction probability results to predict timing constraint violations in real-time embedded system with a known hardware transient failure model. A...
important problem in both wireless and wired communication networks is to be able to efficiently multicast information to a group of network sites. Multicasting reduces the transmission overhead of...
Efficient Residential Consumer Device Interaction With Network Services (2009)
Eric William Burger, Senior Member, Ophir Frieder
Abstract — Consumer access networks are low bandwidth, especially in the up-stream direction. This asymmetry is heightened for real-time multimedia applications. The web wait is not acceptable when...
A Novel System for Remote Control of Household Devices Using Digital IP Phones (2009)
Eric William Burger, Senior Member, Ophir Frieder
Abstract — The idea of using a phone as a remote control for household devices is not new. However, new digital technologies such as Voice-over-IP and signaling protocols such as SIP enable new...
CHRISTOPHEK STATON, American Management Systems (2009)
Bret Bailey Sybuse, Ophir Frieder
The foLlowing case.rt&y shows how a simple p;r-otoqpe can be used to ve7*zfi, before production, that a system will pe$om at an acceptable he1 under realistic conditions.,s.&.-.. I _ _.,,,...
Ensemble LUT classification for degraded document enhancement (2009)
Tayo Obafemi-ajayi, Gady Agam, Ophir Frieder
The fast evolution of scanning and computing technologies have led to the creation of large collections of scanned paper documents. Examples of such collections include historical collections, legal...
Query Workload Driven Summarization for P2P Query Routing (2009)
Linh Thai Nguyen, Wai Gen Yee, Ophir Frieder
Query routing in peer-to-peer systems is based on the peers ’ content summaries. For the sake of scalability, summaries are built at a “peer level. ” The coarseness of peer level summaries...
Spam Characterization and Detection in Peer-to-Peer File- Sharing Systems (2009)
Dongmei Jia, Wai Gen Yee, Ophir Frieder
Spam is highly pervasive in P2P file-sharing systems and is difficult to detect automatically before actually downloading a file due to the insufficient and biased description of a file returned to a...
Adaptive Distributed Indexing for Structured Peer-to-Peer Networks (2009)
Linh Thai Nguyen, Wai Gen Yee, Ophir Frieder
Structured peer-to-peer networks support keyword search by building a distributed index over the collective content shared by all peers. Building the index and processing queries involve data...
Passage relevance models for genomics search (2009)
Urbain, Jay, Frieder, Ophir, Goharian, Nazli
Abstract We present a passage relevance model for integrating syntactic and semantic evidence of biomedical concepts and topics using a probabilistic graphical model. Component models of topics,...
The Object Behavior of Java Object-Oriented Database Management Systems (2009)
Morris Chang, Ophir Frieder, David Grossman
Due to its portability and popularity for Internet applications, Java has become one of the major programming languages. The similar syntax inherited from the C language and the pure object...
On Mediated Search of the United States Holocaust Memorial Museum Data (2008)
Ophir Frieder, David Grossman, Larry Kane
recently celebrated its ten-year anniversary. The museum was established to bear witness to the human atrocities committed by the Nazi reign of terror. As such, related data must be collected, and...
Wai Gen Yee, Ophir Frieder, Dongmei Jia
Data crawler collects information from Gnutella networks and stores it in MySQL database. The information collected by IR-Wire includes: • User queries • Peer information • Peer content...
Novel Applications of Information Retrieval Techniques to Peer-to-Peer File-Sharing Systems (2008)
Wai Gen Yee, Linh Thai Nguyen, Ophir Frieder
A leading application of peer-to-peer technology is file sharing. Because of its scale – in the millions of users daily – it is important that file-sharing systems have good search capabilities....
David A. Grossman, Ophir Frieder, David O. Holmes
In this our rst year of TREC participation, we implemented an IR system using an AT&T DBC-1012 Model 4 parallel relational database machine. We started with the premise that a relational system...
Dongmei Jia, Wai Gen Yee, Ophir Frieder
Peer-to-peer file-sharing systems have poor search performance for rare or poorly described files. These files lack a quality or variety of metadata making them hard to match with queries. A server...
Distributed, Automatic File Description Tuning in Peer-to-Peer File-Sharing Systems (2008)
Dongmei Jia, Wai Gen, Yee Linh, Thai Nguyen, Ophir Frieder
Peers in peer-to-peer file-sharing systems cannot effectively share their files if they are poorly described. Terms one user employs to describe an instance of a file may not be those that are...
Abstract Sparse Power Efficient Topology for Wireless Networks (2008)
Xiang-yang Li, Peng-jun Wan, Yu Wang, Ophir Frieder
We consider how to construct power efficient wireless ad hoc networks. We propose two different methods combin-ing several well-known proximity graphs including Gabriel graph and Yao graph, which can...
ABSTRACT OVSF-CDMA Code Assignment in Wireless Ad Hoc Networks ∗ (2008)
Orthogonal Variable Spreading Factor (OVSF) CDMA code provides a means of support of variable rate data service at low hardware cost. In contrast to the conventional orthogonal fixed-spreading-factor...
Localized Topology Control for Unicast and Broadcast in Wireless Ad Hoc Networks (2008)
Wen-zhan Song, Xiang-yang Li, Ophir Frieder, Weizhao Wang
Abstract — We propose a novel localized topology control algorithm for each wireless node to locally select communication neighbors and adjust its transmission power accordingly, such that all...
Dongmei Jia, Wai Gen Yee, Ophir Frieder
Peer-to-peer file-sharing systems have poor search performance for rare or poorly described files. These files lack a quality or variety of metadata making them hard to match with queries. A server...
Content Analysis of Text using Nearest Neighbor (2008)
Sobhan Raj, Hota Guided, Ophir Frieder
It has never been assumed that an IRS should attempt to understand the content of a document. Most IRS at this moment, aim at a bibliographic (www.dictionary.com: A list of the works of a specific...
What is a Duplicate Document? (2008)
Ophir Frieder, Ophir Frieder, Ophir Frieder
� Ingest data from multiple sources – Duplicate document detection � Process multiple type data sources – Structured & unstructured data integration (SIRE) � Use scalable (parallel)...
IIT at TREC-2003 (DRAFT) Task Classification & Document Structure for Known-Item Search (2008)
Steve Beitzel, Eric Jensen, David Grossman, Ophir Frieder, Abdur Chowdhury, Greg Pass, ...
This year’s TREC 2003 web task incorporated two retrieval tasks into a single set of experiments for Known-Item retrieval. We hypothesized that not all retrieval tasks should use the same retrieval...
Exploiting Parallelism to Support Scalable Hierarchical Clustering ∗ (2008)
Rebecca Cathey, Eric C. Jensen, Steven M. Beitzel, Ophir Frieder, David Grossman
A distributed memory parallel version of the group average Hierarchical Agglomerative Clustering algorithm is proposed to enable scaling the document clustering problem to large collections. Using...
Rationale- Research Component (2008)
Jay Urbain, Nazli Goharian, Ophir Frieder, Query Processing
Passage retrieval model for combining concept-based semantics and term statistics in context. Objective: Passage retrieval precision Methods: • Datawarehouse style indexing for efficient search and...
On Mediated Search of the United States Holocaust Memorial Museum Data (2008)
Ophir Frieder, David Grossman, Larry Kane
recently celebrated its ten-year anniversary. The museum was established to bear witness to the human atrocities committed by the Nazi reign of terror. As such, related data must be collected, and...
IIT at TREC-2003 Task Classification & Document Structure for Known-Item Search (2008)
Steve Beitzel, Eric Jensen, Rebecca Cathey, Ling Ma, David Grossman, Ophir Frieder, ...
This year’s TREC 2003 web task incorporated two retrieval tasks into a single set of experiments for Known-Item retrieval. We hypothesized that not all retrieval tasks should use the same retrieval...
Abstract IIT at TREC-8: Improving Baseline Precision (2008)
M. Catherine Mccabe, Abdur Chowdhury, David O. Holmes, David A. Grossman, Kenneth L. Alford, Ophir Frieder
In TREC-8, we participated in the automatic and manual tracks for category A as well as the small web track. This year, we focussed on improving our baseline and then introduced some experimental...
Channel Alternation And Rotation For Trisectorized Cellular Systems (2008)
Vincent A. Nguyen, Peng-jun Wan, Ophir Frieder
Abstract- Conventional trisectored cellular systems have not taken full advantages of antenna directivities to enhance frequency reuse efficiency. A novel Channel Alternation and Rotation (CAR)...
C.4 [Computer Systems Organizationl: Performance of Systems-dcaign studks, (2008)
We propose a document-searching architecture baaed on high-speed hardware pattern matching to increase the throughput of an information retrieval system. We also propose B new parallel VLSI...
Query Phrase Suggestion from Topically Tagged Session Logs (2008)
Eric C. Jensen, Steven M. Beitzel, Abdur Chowdhury, Ophir Frieder
Abstract. Searchers ’ difficulty in formulating effective queries for their information needs is well known. Analysis of search session logs shows that users often pose short, vague queries and...
Information Retrieval and Visualization using SENTINEL (2008)
Margaret Knepper Robert, Robert Killam, Kevin L. Fox, Ophir Frieder
INTRODUCTION Harris Corporation focuses on information retrieval support for various Government agencies. Time constraints and interest-level limit our user to reviewing the top documents before...
IIT at TREC-8: Improving Baseline Precision (2008)
Catherine Mccabe Advanced, M. Catherine Mccabe, Abdur Chowdhury, David O. Holmes, David A. Grossman, Kenneth L. Alford, ...
In TREC-8, we participated in the automatic and manual tracks for category A as well as the small web track. This year, we focussed on improving our baseline and then introduced some experimental...
Efficient Query Routing by Improved Peer Description (2008)
Wai Gen Yee, Linh Thai Nguyen, Dongmei Jia, Ophir Frieder
Peer-to-peer file-sharing systems commonly use the set-of-terms model to describe succinctly a peer’s shared file set: the union of the terms in the share files. This information is used to guide...
in Biological Sequence Analysis (2007)
Tieng K. Yap, Ophir Frieder, Senior Member, Robert L. Martino
Abstract---A massive volume of biological sequence data is available in over 36 different databases worldwide, including the sequence data generated by the Human Genome project. These databases,...
The Frontiers of Massively Parallel Computation (2007)
Tieng K. Yap, Tieng K. Yap, Ophir Frieder, Ophir Frieder, Robert L. Martino, Robert L. Martino
We present a parallel computational method for retrieving similar sequences from large genetic and protein databases using a dynamic programming comparison algorithm. Two previously published...
Unnoticeable Jitter in ATM Workstation Configurations 1 (2007)
Abdur Chowdhury, Ophir Frieder, Manish Karir, Spyro Papademetriou
Abstract. We evaluate the variation of inter-cell arrival times, namely jitter, as seen by a workstation. We present experimental results of actual jitter measurements on the FORE ASX200BX switch and...
1 Using Relevance Feedback within the Relational Model for TREC-5 (2007)
David A. Grossman, Carol Lundquist, John Reichart, David Holmes, Abdur Chowdhury, Ophir Frieder
For TREC-5, we enhanced our existing prototype that implements relevance ranking using the AT&T DBC-1012 Model 4 parallel database machine to include relevance feedback. We identified SQL to...
M. Catherine Mccabe, Jinho Lee, Abdur Chowdhury, David Grossman, Ophir Frieder
{jinho, abdur, dagr, ophir} @ ir.iit.edu We present a method of searching text collections that takes advantage of the inherent hierarchrical information within documents and integrates searches of...
M. Catherine Mccabe, Abdur Chowdhury, David O. Holmes, David A. Grossman, Kenneth L. Alford, Ophir Frieder
In TREC-8, we participated in the automatic and manual tracks for category A as well as the small web track. This year, we first ensured that our baseline matched the effectiveness achieved by other...
Ophir Frieder, Ophir Frieder, Ophir Frieder, Ophir Frieder
Goals o Integrate structured and semi-structured data using a framework that can also integrate unstructured data. o Improve accuracy of retrieved results o Support scalability: data volume ...
Unnoticeable Jitter in ATM Workstation Configurations (2007)
Abdur Chowdhury And, Abdur Chowdhury, Ophir Frieder, Manish Karir, Spyro Papademetriou
. We evaluate the variation of inter-cell arrival times, namely jitter, as seen by a workstation. We present experimental results of actual jitter measurements on the FORE ASX200BX switch and a...
On the Implementation of a Relational Scalable Information Retrieval Engine (2007)
M. Catherine Mccabe, David Grossman, Ophir Frieder, Abdur Chowdhury
INTRODUCTION We present a Scalable Information Retrieval Engine, called SIRE, implemented strictly as a relational database application. We illustrate how key information retrieval techniques such as...
Large Scale Information Processing (2007)
This paper is a revision of the IEEE IC3N-98 paper, one of the papers selected from the conference to appear in a special issue dedicated to the conference.)
A Parallel RDBMS Approach to Implement Relevance Feedback (Extended Abstract) (2007)
this paper, the terms in the relevant documents were sorted using six different sort orders. The sorts were done using either were done using either
High Performance DBMS and Information Retrieval Systems (2007)
David Grossman, Ophir Frieder, Sanjeev Setia
Introduction This will review some current industry solutions for high performance Database Management Systems (DBMS) and Information Retrieval (IR) solutions. Although numerous academic prototypes...
Ophir Frieder, Senior Member, Hava T. Siegelmann
Abstract—We formally define the Multiprocessor Document Allocation Problem (MDAP) and prove it to be computationally intractable (NP Complete). Once it is shown that MDAP is NP Complete, we...
xploiting parallelism in database processing has been a research goal since the early 1970s. Originally, special-purpose architectures were developed to provide the computational and input/ output...
1 Introduction A Radio Coloring of a Hypercube (2007)
Ophir Frieder, Frank Harary, Peng-jun Wan
A radio coloring of a graph G is an assignment of nonnegative
ABSTRACT Parallelizing the Buckshot Algorithm for Efficient Document Clustering ∗ (2007)
Eric C. Jensen, Steven M. Beitzel, Angelo J. Pilotto, Nazli Goharian, Ophir Frieder
We present a parallel implementation of the Buckshot document clustering algorithm. We demonstrate that this parallel approach is highly efficient both in terms of load balancing and minimization of...
Ophir Frieder, Abdur Chowdhury, David A. Grossman, Gideon Frieder
We review a variety of techniques to improve efficiency in information retrieval. Given the increasing volumes of data that are available electronically, understanding and using such techniques is...
General Conference (Part Grooming of `Arbitrary Traffic in SONET/WDM Rings (2007)
Jun Wan, Liwu Liu, Ophir Frieder
Abstract- add-drop multiplexers are the dominant cost in SONET/WDM rings. They can potentially be reduced by optical bypass via-wavelength drop multiplexers and grooming. While many works have been...
in ti! e Medical Arena Archiving and Communications (2007)
Exploitin Databasetechnology, Ophir Frieder
Systems (PACS) are the back-bone of the totally digital radiology department. They supply the communications, storage, and user-interface. PACS attempts to utilize computing power to increase the...
Ophir Frieder, Chaitanya K. Baru
Query scheduling and site selection algorithms for read-only quenes on a cube-connected multicomputer are presented. The assumed global system architecture was initially presented in [Fri87]. An...
Abstract Optimal Placement of Wavelength Converters in Trees and Trees of Rings* (2007)
Peng-jun Wan, Liwu Liu, Ophir Frieder
In wavelength routed optical networks, wavelength converters can potentially reduce the requirement on the number of wavelengths. The problem of placing a minimum number of wavelength converters in a...
Abdur Chowdhury, Mohammed Aljlayl, Eric Jensen, Steve Beitzel, David Grossman, Ophir Frieder
For TREC 10 we participated in the Named Page Finding Task and the Cross-Lingual Task. In the web track, we explored the use of linear combinations of term collections based on document structure....
PARALLEL MULTIPLE SEQUENCE ALIGNMENT (2007)
Ophir Frieder, Robert L. Martino, Tieng K. Yap, Tieng K. Yap, Peter J. Munson, Peter J. Munson
Abstract-- Many different methods have been presented for aligning multiple biological sequences. These methods can be classified into three categories: rigorous, tree-based, and iterative. The...
Localized Routing for Wireless Ad Hoc Networks (2007)
Xiang-yang Li, Yu Wang, Ophir Frieder
We show that, given a set of randomly distributed wireless nodes with density n, when the transmission range r_n of wireless nodes satisfies #r log n+c(n) n , the localized Delaunay triangulation...
Steven M. Beitzel, Eric C. Jensen, David Grossman, Nazli Goharian, Ophir Frieder
Prior efforts have shown that data fusion techniques can be used to improve retrieval effectiveness under certain situations. Although the precise conditions necessary for fusion to improve retrieval...
Domenico Laforenza, Ophir Frieder, Marco Vanneschi, Gerhard Weikum
ma chi parte mona, torna mona.
A Tool for Information Retrieval Research in Peer-to-Peer File Sharing Systems (2007)
Linh Thai Nguyen, Wai Gen Yee, Dongmei Jia, Ophir Frieder
We introduce IR-Wire, a tool for information retrieval research and education in peer-to-peer file-sharing systems. Built on top of LimeWire’s implementation of the popular Gnutella standard, it...
1 The World Around Us • All kinds of devices – everywhere! (2007)
Ophir Frieder, David Grossman, Information Retrieval, Courtesy Dr, Xiang-yang Li
– Xiang-Yang Li, cryptography, network optimization algorithms – Peng-Jun Wan, network optimization algorithms – Wai Gen Yee, peer-to-peer, distributed systems • Students: – 2 – 3...
Using a Relational Database for Scalable XML Search (2007)
Rebecca J. Cathey, Steven M. Beitzel, Eric C. Jensen, David Grossman, Ophir Frieder
XML is a flexible and powerful tool that enables information and security sharing in heterogeneous environments. Scalable technologies are needed to effectively manage the growing volumes of XML...
Relationally Mapping XML Queries for Scalable XML Search (2007)
Rebecca J. Cathey, Steven M. Beitzel, Eric C. Jensen, David Grossman, Ophir Frieder
Abstract—The growing trend of using XML to share security data requires scalable technology to effectively manage the volume and variety of data. Although a wide variety of methods exist for...
Efficient Interference-Aware TDMA Link Scheduling for Static Wireless Networks (2006)
Weizhao Wang, Yu Wang, Xiang-yang Li, Wen-zhan Song, Ophir Frieder
We study efficient link scheduling for a multihop wireless network to maximize its throughput. Efficient link scheduling can greatly reduce the interference effect of close-by transmissions. Unlike...
Wai Gen Yee, Linh Thai Nguyen, Ophir Frieder
Peers in peer-to-peer file-sharing systems use conjunctive queries as a way of controlling query cost in the absence of information about the behavior of other peers. Conjunctive queries are good at...
Automatic web query classification using labeled and unlabeled training data (2005)
Steven M. Beitzel, Eric C. Jensen, Ophir Frieder, David Grossman
Accurate topical categorization of user queries allows for increased effectiveness, efficiency, and revenue potential in general-purpose web search systems. Such categorization becomes critical if...
Predicting query difficulty on the web by learning visual clues (2005)
Eric C. Jensen, Steven M. Beitzel, David Grossman, Ophir Frieder
We describe a method for predicting query difficulty in a precision-oriented web search task. Our approach uses visual features from retrieved surrogate document representations (titles, snippets,...
Eric C. Jensen, Steven M. Beitzel, Ophir Frieder
We describe a framework of bootstrapped hypothesis testing for estimating the confidence in one web search engine outperforming another over any randomly sampled query set of a given size. To...
Surrogate scoring for improved metasearch precision (2005)
Steven M. Beitzel, Eric C. Jensen, Ophir Frieder
We describe a method for improving the precision of metasearch results based upon scoring the visual features of documents' surrogate representations. These surrogate scores are used during...
Regional Gossip Routing for Wireless Ad Hoc Networks (2005)
Xiang-yang Li, Kousha Moaveninejad, Ophir Frieder
Many routing protocols have been proposed for wireless ad hoc networks, and most of them are based on some variants of flooding. Thus many routing messages are propagated through the network...
Finding Rare Data Objects in P2P File-Sharing Systems (2005)
Wai Gen Yee, Dongmei Jia, Ophir Frieder
Peer-to-peer file-sharing systems have hundreds of thousands of users sharing petabytes of data, however, their search functionality is limited. In general, query results contain many references to...
Automatic web query classification using labeled and unlabeled training data (2005)
Steven M. Beitzel, Eric C. Jensen, Ophir Frieder, David Grossman
Accurate topical categorization of user queries allows for increased effectiveness, efficiency, and revenue potential in general-purpose web search systems. Such categorization becomes critical if...
Eric C. Jensen, Steven M. Beitzel, Ophir Frieder
We describe a framework of bootstrapped hypothesis testing for estimating the confidence in one web search engine outperforming another over any randomly sampled query set of a given size. To...
Improving automatic query classification via semi-supervised learning (2005)
Steven M. Beitzel, Eric C. Jensen, Ophir Frieder
Accurate topical classification of user queries allows for increased effectiveness and efficiency in general-purpose web search systems. Such classification becomes critical if the system is to...
Predicting query difficulty on the web by learning visual clues (2005)
Eric C. Jensen, Steven M. Beitzel, David Grossman, Ophir Frieder
We describe a method for predicting query difficulty in a precision-oriented web search task. Our approach uses visual features from retrieved surrogate document representations (titles, snippets,...
Surrogate scoring for improved metasearch precision (2005)
Steven M. Beitzel, Eric C. Jensen, Ophir Frieder
We describe a method for improving the precision of metasearch results based upon scoring the visual features of documents' surrogate representations. These surrogate scores are used during...
Eric C. Jensen, Steven M. Beitzel, Ophir Frieder
We describe a framework of bootstrapped hypothesis testing for estimating the confidence in one web search engine outperforming another over any randomly sampled query set of a given size. To...
Localized low weight graph and its applications in wireless ad hoc networks (2004)
Xiang-yang Li, Peng-jun Wan, Wen-zhan Song, Ophir Frieder
Abstract — We propose a new localized structure, namely, Incident MST and RNG Graph (IMRG), for topology control and broadcasting in wireless ad hoc networks. In the construction algorithm, each...
Localized algorithms for energy efficient topology in wireless ad hoc networks (2004)
Wen-zhan Song, Yu Wang, Xiang-yang Li, Ophir Frieder
Abstract. Topology control in wireless ad hoc networks is to select a subgraph of the communication graph (when all nodes use their maximum transmission range) with some properties for energy...
OVSF-CDMA code assignment for wireless ad hoc networks (2004)
Peng-Jun Wan, Xiang-Yang Li, Ophir Frieder
Orthogonal Variable Spreading Factor (OVSF) CDMA code consists of an infinite number of codewords with variable rates, in contrast to the conventional orthogonal fixed-spreadingfactor CDMA code....
Evaluation of filtering current news search results (2004)
Steven M. Beitzel, Eric C. Jensen, Abdur Chowdhury, David Grossman, Ophir Frieder
We describe an evaluation of result set filtering techniques for providing ultra-high precision in the task of presenting related news for general web queries. In this task, the negative user...
Localized low weight graph and its applications in wireless ad hoc networks (2004)
Xiang-yang Li, Yu Wang, Peng-jun Wan, Wen-zhan Song, Ophir Frieder
Abstract — We propose a new localized structure, namely, Incident MST and RNG Graph (IMRG), for topology control and broadcasting in wireless ad hoc networks. In the construction algorithm, each...
Migrating Information Retrieval from the Graduate to the Undergraduate Curriculum (2004)
Nazli Goharian, David Grossman, Ophir Frieder, Nambury Raju
this document is a little bit relevant" that may well serve to improve our understanding of existing effectiveness tools. It should be noted that asking students to invent something is appealing...
Evaluation of filtering current news search results (2004)
Steven M. Beitzel, Eric C. Jensen, Abdur Chowdhury, David Grossman, Ophir Frieder
We describe an evaluation of result set filtering techniques for providing ultra-high precision in the task of presenting related news for general web queries. In this task, the negative user...
The design of pirs, a peer-to-peer information retrieval system (2004)
Abstract. We describe the design of PIRS, a peer-to-peer information retrieval system. Unlike some other proposed approaches, PIRS does not require the centralization of data onto specially...
Hourly analysis of a very large topically categorized web query log (2004)
Steven M. Beitzel, Eric C. Jensen, Abdur Chowdhury, David Grossman, Ophir Frieder
We review a query log of hundreds of millions of queries that constitute the total query traffic for an entire week of a generalpurpose commercial web search service. Previously, query logs have been...
On fusion of effective retrieval strategies in the same information retrieval system (2004)
Steven M. Beitzel, Eric C. Jensen, Abdur Chowdhury, David Grossman, Ophir Frieder, Nazli Goharian
Prior efforts have shown that under certain situations, retrieval effectiveness may be improved via the use of data fusion techniques. Although these improvements have been observed from the fusion...
k-Anycast Game in Selfish Networks (2004)
Weizhao Wang, Xiang-yang Li, Ophir Frieder
Abstract — Conventionally, many network routing protocols assumed that every network node will forward data packets for other nodes without any deviation. However, this may not be true when nodes...
Steven M. Beitzel, Eric C. Jensen, Abdur Chowdhury, Ophir Frieder, David Grossman, Nazli Goharian
Many prior efforts have been devoted to the basic idea that data fusion techniques can improve retrieval effectiveness. Recent work in the area suggests that many approaches, particularly...
Chih-wei Yi, Peng-jun Wan, Xiang-yang Li, Ophir Frieder
Abstract—Nodes in wireless ad hoc networks may become inactive or unavailable due to, for example, internal breakdown or being in the sleeping state. The inactive nodes cannot take part in...
Geometric spanners for wireless ad hoc networks (2003)
Khaled Alzoubi, Xiang-yang Li, Yu Wang, Peng-jun Wan, Ophir Frieder
Abstract—We propose a new geometric spanner for static wireless ad hoc networks, which can be constructed efficiently in a localized manner. It integrates the connected dominating set and the local...
Using Manually-built Web Directories for Automatic Evaluation of Known-Item Retrieval (2003)
Steven M. Beitzel, Eric C. Jensen, Abdur Chowdhury, David Grossman, Ophir Frieder
Information retrieval system evaluation is complicated by the need for manually assessed relevance judgments. Large manually-built directories on the web open the door to new evaluation procedures....
On scalable information retrieval systems (2003)
Scalable information retrieval systems are crucial to meeting the growing volumes of data. We describe work done to facilitate scalability by reducing duplication, providing integration with...
Using Manually-built Web Directories for Automatic Evaluation of Known-Item Retrieval (2003)
Steven M. Beitzel, Eric C. Jensen, Abdur Chowdhury, David Grossman, Ophir Frieder
Information retrieval system evaluation is complicated by the need for manually assessed relevance judgments. Large manually-built directories on the web open the door to new evaluation procedures....
On scalable information retrieval systems (2003)
Scalable information retrieval systems are crucial to meeting the growing volumes of data. We describe work done to facilitate scalability by reducing duplication, providing integration with...
Geometric spanners for wireless ad hoc networks (2003)
Khaled Alzoubi, Xiang-yang Li, Yu Wang, Peng-jun Wan, Ophir Frieder
Abstract—We propose a new geometric spanner for static wireless ad hoc networks, which can be constructed efficiently in a localized manner. It integrates the connected dominating set and the local...
Coverage in wireless ad-hoc sensor networks (2003)
Xiang-yang Li, Peng-jun Wan, Ophir Frieder
Abstract—Sensor networks pose a number of challenging conceptual and optimization problems such as location, deployment, and tracking [1]. One of the fundamental problems in sensor networks is the...
On the Development of Name Search Techniques for Arabic (2003)
Syed Uzair Aqeel, Steve Beitzel, Eric Jensen, David Grossman, Ophir Frieder
The need for effective identity matching systems has led to extensive research in the area of name search. For the most part, such work has been limited to English and other Latin-based languages....
Fault Tolerant Sensor Networks with Bernoulli Nodes (2003)
Chih-wei Yi, Peng-jun Wan, Xiang-yang Li, Ophir Frieder
Connectivity, power consuming and fault tolerance are three critical issues in sensor networks. In this paper, sensor networks are modeled by the unit disc graph, random point process and Bernoulli...
On the Development of Name Search Techniques for Arabic (2003)
Syed Uzair Aqeel, Steve Beitzel, Eric Jensen, Ophir Frieder, David Grossman
The need for effective identity matching systems has led to extensive research in the area of name search. For the most part, such work has been limited to English and other Latin-based languages....
New distributed algorithm for connected dominating set in wireless ad hoc networks (2002)
Khaled M. Alzoubi, Peng-jun Wan, Ophir Frieder
Abstract—Connected dominating set (CDS) has been proposed as virtual backbone or spine of wireless ad hoc networks. Three distributed approximation algorithms have been proposed in the literature...
Distributed heuristics for connected dominating sets in wireless ad hoc networks (2002)
Khaled M. Alzoubi, Peng-jun Wan, Ophir Frieder
Abstract: A connected dominating set (CDS) for a graph ¢¡¤£¦¥¨§�© is a subset £� � of £, such that each node in £���£�� is adjacent induces a connected subgraph. CDSs...
Distributed construction of connected dominating set in wireless ad hoc networks (2002)
Peng-jun Wan, Khaled M. Alzoubi, Ophir Frieder
Abstract—Connected dominating set (CDS) has been proposed as virtual backbone or spine of wireless ad hoc networks. Three distributed approximation algorithms have been proposed in the literature...
Sparse power efficient topology for wireless networks (2002)
Xiang-yang Li, Peng-jun Wan, Yu Wang, Ophir Frieder
Abstract dimensional plane. Each wireless node has an omnidirectional antenna. This is attractive for a single trans-We consider how to construct power eficient wireless ad mission of a node can be...
Abdur Chowdhury, Mohammed Aljlayl, Eric Jensen, Steve Beitzel, David Grossman, Ophir Frieder
For TREC 10 we participated in the Named Page Finding Task and the Cross-Lingual Task. In the web track, we explored the use of linear combinations of term collections based on document structure....
G. Calinescu, Ophir Frieder, Senior Member, Peng-jun Wan
Abstract—Automatic ring protection provides simple and rapid fault protection and restoration in telecommunication networks. To implement the automatic ring protection in general wavelengthdivision...
Distributed construction of connected dominating set in wireless ad hoc networks (2002)
Peng-jun Wan, Khaled M. Alzoubi, Ophir Frieder
Abstract—Connected dominating set (CDS) has been proposed as virtual backbone or spine of wireless ad hoc networks. Three distributed approximation algorithms have been proposed in the literature...
Collection Statistics for Fast Duplicate (2002)
Document Detection Abdur, Abdur Chowdhury, Ophir Frieder, David Grossman, Mary Catherine Mccabe
this paper. 3. ALGORITHM Our motivation is to provide a duplicate detection algorithm that can scale to the size of the web and handle the short documents typically seen there. Furthermore, we seek...
Distributed construction of connected dominating set in wireless ad hoc networks (2002)
Khaled M. Alzoubi, Peng-jun Wan, Ophir Frieder
Connected dominating set (CDS) has been proposed as virtual backbone or spine of wireless ad hoc networks. Three distributed approximation algorithms have been proposed in the literature for minimum...
Coverage in Wireless Ad-hoc Sensor Networks (2002)
Xiang-yang Li, Peng-jun Wan, Ophir Frieder
Sensor networks pose a number of challenging conceptual and optimization problems such as location, deployment, and tracking [1]. One of the fundamental problems in sensor networks is the calculation...
Sparse Power Efficient Topology for Wireless Networks (2002)
Xiang-yang Li, Peng-jun Wan, Yu Wang, Ophir Frieder
Due to the nodes' limited ressource in the wireless ad hoc networks, it is important to maintain only a linear number of links...
Effective Arabic-English Cross-Language Information Retrieval via (2001)
Mohammed Aljlayl, Ophir Frieder, David Grossman
A Machine Translation (MT) system is an automatic process that translates from one human language to another language by using context information. We evaluate the use of an MT-based approach for...
On the Integration of Structured Data and Text: A Review of the SIRE Architecture (2000)
Ophir Frieder, Abdur Chowdhury, David Grossman, M. Catherine Mccabe
Over the past decade, members of the Information Retrieval Lab have designed, developed, and deployed a variety of information retrieval systems. A central theme for all of our systems was the...
Wavelength Assignment in WDM Rings to Minimize SONET ADMs (2000)
Liwu Liu, Xiangyang Li, Peng-jun Wan, Ophir Frieder
Abstract — We study wavelength assignment for lightpaths over WDM rings to minimize the SONET ADMs used. This problem has attracted much attention recently. However, its computation complexity...
Grooming of arbitrary traffic in SONET/WDM BLSRs (2000)
Peng-jun Wan, Gruia Călinescu, Liwu Liu, Ophir Frieder
Abstract—SONET add–drop multiplexers (ADMs) are the dominant cost factor in the SONET/WDM rings. They can potentially be reduced by optical bypass via optical add–drop multiplexers (OADMs) and...
Efficiency considerations for scalable information retrieval servers (2000)
Ophir Frieder, Comp Sci, David A. Grossman, Abdur Chowdhury, Gideon Frieder
We overview a variety of techniques to improve efficiency in information retrieval. Given the increasing volumes of data that are available electronically, understanding and using such techniques is...
Wavelength Assignment in WDM Rings to Minimize SONET ADMs (2000)
Liwu Liu, Xiangyang Li, Peng-jun Wan, Ophir Frieder
Abstract — We study wavelength assignment for lightpaths over WDM rings to minimize the SONET ADMs used. This problem has attracted much attention recently. However, its computation complexity...
Efficiency considerations in very large information retrieval servers (2000)
Ophir Frieder, David A. Grossman, Abdur Chowdhury, Comp Sci, Comp Sci
It is estimated that the World Wide Web now contains more than twenty million different content areas, presented on more than 320 million web pages, and one million web servers---and it is doubling...
Network Survivability Simulation of the Commercially Deployed Dynamic Routing System Protocol (2000)
Abdur Chowdhury, Ophir Frieder, Paul Luse, Peng-jun Wan
Abstract. With the ever-increasing demands on server applications, many new server services are distributed in nature. We evaluated one hundred deployed systems and found that over a one-year period,...
Practical Traffic Grooming Scheme for Single-Hub SONET/WDM Rings (2000)
Xiang-yang Li, Liwu Liu, Peng-jun Wan, Ophir Frieder
In SONET/WDM networks, one fiber supports multiple wavelengths and each wavelength supports several low rate tributary streams. "Traffic grooming" then is defined as properly using SONET...
Carol Lundquist, Ophir Frieder, David Grossman, David O. Holmes
Abstract. A scalable, parallel, relational-database driven information retrieval engine is described. To support portability across a wide-range of execution environments, including parallel...
Carol Lundquist, Ophir Frieder, David Grossman, David O. Holmes
Abstract. A scalable, parallel, relational-database driven information retrieval engine is described. To support portability across a wide-range of execution environments, including parallel...
Wavelength Assignment in WDM Rings to Minimize SONET ADMs (1999)
Liwu Liu, Xiangyang Li, Peng-jun Wan, Ophir Frieder
We study wavelength assignment for lightpaths over WDM rings to minimize the SONET ADMs used. This problem has attracted much attention recently. However, its computation complexity remains unknown,...
Carol Lundquist Ophir, Ophir Frieder, David Grossman, David O. Holmes
. A scalable, parallel, relational-database driven information retrieval engine is described. To support portability across a wide-range of execution environments, including parallel multicomputers,...
A Unified Environment for Fusion of Information Retrieval Approaches (1999)
M. Catherine McCabe, Abdur Chowdhury, M. Catherine, Mccabe Abdur Chowdhury, David A. Grossman, Ophir Frieder
Prior work has shown that combining results of various retrieval approaches and query representations can improve search effectiveness. Today, many meta-search engines exist which combine the results...
M. Catherine Mccabe, Abdur Chowdhury, David O. Holmes, David A. Grossman, Kenneth L. Alford, Ophir Frieder
In TREC-8, we participated in the automatic and manual tracks for category A as well as the small web track. This year, we first ensured that our baseline matched the effectiveness achieved by other...
SENTINEL: A Multiple Engine Information Retrieval and Visualization System (1999)
Visualization System, Kevin L. Fox, Margaret M. Knepper, Ophir Frieder, Eric J. Snowberg
We describe a prototype Information Retrieval system, SENTINEL, under development at Harris Corporation's Information Systems Division. SENTINEL is a fusion of multiple information retrieval...
Parallel computation in biological sequence analysis (1998)
Tieng K. Yap, Ophir Frieder, Senior Member, Robert L. Martino
Abstract—A massive volume of biological sequence data is available in over 36 different databases worldwide, including the sequence data generated by the Human Genome project. These databases,...
David O. Holmes, M. Catherine Mccabe, David A. Grossman, Abdur Chowdhury, Ophir Frieder
In TREC-7, we participated in both the automatic and manual tracks for category A. For the automatic runs, we included a baseline run and an experimental run that filtered relevance feedback using...
David O. Holmes, M. Catherine Mccabe, David A. Grossman, Abdur Chowdhury, Ophir Frieder
In TREC-7, we participated in both the automatic and manual tracks for category A. For the automatic runs, we included a baseline run and an experimental run that filtered relevance feedback using...
David O. Holmes, M. Catherine Mccabe, David A. Grossman, Abdur Chowdhury, Ophir Frieder
In TREC-7, we participated in both the automatic and manual tracks for category A. For the automatic runs, we included a baseline run and an experimental run that filtered relevance feedback using...
Parallel computation in biological sequence analysis (1998)
Tieng K. Yap, Ophir Frieder, Senior Member, Robert L. Martino
Abstract—A massive volume of biological sequence data is available in over 36 different databases worldwide, including the sequence data generated by the Human Genome project. These databases,...
Integrating Structured Data and Text: A Relational Approach (1997)
David A. Grossman, Ophir Frieder, David O. Holmes, David C. Roberts
We integrate structured data and text using the unchanged, standard relational model. We started with the premise that a relational system could be used to implement an Information Retrieval (IR)...
Integrating Structured Data and Text: A relational approach (1997)
David A. Grossman, David O. Holmes, Ophir Frieder, David C. Roberts
Weintegrate structured data and text using the unchanged, standard relational model. We started with the premise that a relational system could be used to implement an Information Retrieval #IR#...
Clustering and Classification of Large Document Bases in a Parallel Environment (1997)
Anthony S. Ruocco, Ophir Frieder
: Development of cluster-based search systems has been hampered by prohibitive times involved in clustering large document sets. Once completed, maintaining cluster organizations is difficult in...
Integrating Structured Data and Text: A Relational Approach (1997)
David A. Grossman, Ophir Frieder, David O. Holmes, David C. Roberts
We integrate structured data and text using the unchanged, standard relational model. We started with the premise that a relational system could be used to implement an Information Retrieval (IR)...
DRS A Fault Tolerant Network Routing System For Mission Critical Distributed Applications (1996)
Abdur Chowdhury, David Grossman, Eric Burger, Ophir Frieder
We present a novel proactive routing algorithm that consistently searches for failures via frequent ICMP echo requests. Our algorithm differs from its predecessors in that it is proactive instead of...
Using Relevance Feedback within the Relational Model for TREC-5 (1996)
David Grossman Carol, David A. Grossman, Carol Lundquist, John Reichart, David Holmes, Abdur Chowdhury, ...
For TREC-5, we enhanced our existing prototype that implements relevance ranking using the AT&T DBC-1012 Model 4 parallel database machine to include relevance feedback. We identified SQL to...
Using Relevance Feedback within the Relational Model for TREC-5 (1996)
David A. Grossman, Carol Lundquist, John Reichart, David Holmes, Abdur Chowdhury, Ophir Frieder
For TREC-5, we enhanced our existing prototype that implements relevance ranking using the AT&T DBC-1012 Model 4 parallel database machine to include relevance feedback. We identified SQL to...
A Parallel DBMS Approach to IR in TREC-3 (1995)
David A. Grossman, Office Information Technology, Ophir Frieder
Finally, in a separate set of work, we implemented Damashek's n-gram algorithm for n=3 and were able to show similar results as found when n=5. 1 Introduction For TREC-3, we implemented...
A Parallel DBMS Approach to IR in TREC-3 (1994)
David A. Grossman, Ophir Frieder, David O. Holmes
In this our first year of TREC participation, we implemented an IR system using an AT&T DBC-1012 Model 4 parallel relational database machine. We started with the premise that a relational system...
A Parallel DBMS Approach to IR in TREC-3 (1994)
David A. Grossman, David O. Holmes, Ophir Frieder
In this our first year of TREC participation, we implemented an IR system using an AT&T DBC-1012 Model 4 parallel relational database machine. We started with the premise that a relational system...
Baru: Site and Query Scheduling Policies in Multicomputer Database Systems (1994)
Ophir Frieder, Senior Member, Chaitanya K. Baru, Senior Member
Abstract- We study run-time issues, such as site allocation and query scheduling policies, in executing read-only queries in a hierarchical, distributed memory, multicomputer system. The particular...
On the allocation of documents in multiprocessor information retrieval systems (1991)
Abstract. Information retrieval is the selection of documents that are potentially relevant to a user’s information need. Given the vast volume of data stored in modern information retrieval...
On dynamically updating a computer program: From concept to prototype (1991)
Mark E. Segal, Ophir Frieder, Ophir Frieder, Mark E. Sega
An approach to dynamically updating a computer pro-gram, i.e., updating while it is executing, is presented. Dynamic updating is crucial in applications where the cost of stopping and restarting the...
Database Operations in a Cube-Connected Multicomputer System (1989)
Chaitanya K. Baru, Ophir Frieder
Abstract-Parallel architectures for database processing should pro-vide parallel I/O as well as parallel CPU capability. Two issues that arise in such systems are data combination and nonuniform data...
Protocol verification using database technology (1989)
Abstract-We describe a novel application of database technologies in communications networks: protocol verification on a parallel database machine. We introduce an approach to protocol verification...
DATABASE PROCESSING ON A CUBE-CONNECTED MULTICOMPUTER. (1987)
A boolean cube-connected relational database machine is developed. Strategies for performing the basic relational database operations are presented. The strategies are termed "dynamic" and are novel...
Citation Behavior and Place of Publication in the
Christopher Fox, Anany Levitin, Thomas Redman, Patrick Doreian, Anamarija Bekavac, Jelka Petrak, ...
1 i' dl. *::
Improving Accuracy and Run-Time Performance for TREC-4
David A. Grossman, Ophir Frieder, David O. Holmes, Christopher E. Kingsbury, Matthew D. Nguyen
For TREC-4, we enhanced our existing prototype that implements relevance ranking using the AT&T DBC-1012 Model 4 parallel database machine to support the entire document collection....
DRS: A Fault Tolerant Network Routing System For Mission Critical Distributed Applications
Abdur Chowdhury, David Grossman, Eric Burger, Ophir Frieder
We present a novel proactive routing algorithm that consistently searches for failures via frequent ICMP echo requests. Our algorithm differs from its predecessors in that it is proactive instead of...
Improving Accuracy and Run-Time Performance for TREC-4
David A. Grossman, David O. Holmes, Ophir Frieder, Matthew D. Nguyen, Christopher E. Kingsbury
For TREC-4, we enhanced our existing prototype that implements relevance ranking using the AT&T DBC-1012 Model 4 parallel database machine to support the entire document collection. Additionally,...
Sanjeev Setia, Ophir Frieder, David Grossman
Introduction The data storage and retrieval services utilized by the ECS services layer will be provided by both a Database Management System (DBMS) and a hierarchical file system (see Figure 2.1)....
Improving Accuracy and Run-Time Performance for TREC-4
David Grossman, David O. Holmes, Ophir Frieder, Matthew D. Nguyen, Christopher E. Kingsbury
For TREC-4, we enhanced our existing prototype that implements relevance ranking using the AT&T DBC-1012 Model 4 parallel database machine to support the entire document collection. Additionally,...
Passage relevance models for genomics search
Urbain, Jay, Frieder, Ophir, Goharian, Nazli
We present a passage relevance model for integrating syntactic and semantic evidence of biomedical concepts and topics using a probabilistic graphical model. Component models of topics, concepts,...