Abstract On the Use of Spreading Activation Methods in Automatic Information Retrieval (2008)
Spreading activation methods have been recommended in information retrieval to expand the search vocabulary and to complement the retrieved document sets. The spreading activation strategy is...
Abstract On the Application of Syntactic Methodologies in Automatic Text Analysis (2008)
This study summarizes various linguistic approaches proposed for document analysis in information retrieval environments. Included are standard syntactic methods to generate complex content...
Gerard Salton, Chris Buckley Term, Gerard Salton, Chris Buckley Im, Gerard Salton, Chris Buckley Au, ...
proving retrieval performance by rele- vance feedback. Journal of the Amer- ican Society for Information Science, 41(4):288-297, 1990.
Abstract On the Use of Term Associations in Automatic Information Retrieval (2007)
It has been recognized that single words extracted from natural language texts are not always useful for the representation of information content. Associated or related terms, and complex content...
Automatic Text Decomposition Using Text Segments and Text Themes (1995)
Salton, Gerard, Singhal, Amit, Buckley, Chris, Mitra, Mandar
With the widespread use of full-text information retrieval, passage-retrieval techniques are becoming increasingly popular. Larger texts can then be replaced by important text excerpts, thereby...
Pivoted Document Length Normalization (1995)
Singhal, Amit, Buckley, Chris, Mitra, Mandar, Salton, Gerard
Document length normalization is an important aspect of term weight assignment in an automatic information retrieval system. In this study, we observe that a normalization scheme that retrieves...
Automatic Text Decomposition Using Text Segments and Text Themes (1995)
Salton, Gerard, Singhal, Amit, Buckley, Chris, Mitra, Mandar
With the widespread use of full-text information retrieval, passage-retrieval techniques are becoming increasingly popular. Larger texts can then be replaced by important text excerpts, thereby...
Pivoted Document Length Normalization (1995)
Singhal, Amit, Buckley, Chris, Mitra, Mandar, Salton, Gerard
Document length normalization is an important aspect of term weight assignment in an automatic information retrieval system. In this study, we observe that a normalization scheme that retrieves...
Selective Text Traversal (1995)
In information retrieval and text processing, the size of the available databases and text collections has grown enormously in the past few years, and users find it difficult to cope with the amount...
Selective Text Traversal (1995)
In information retrieval and text processing, the size of the available databases and text collections has grown enormously in the past few years, and users find it difficult to cope with the amount...
Document Length Normalization (1995)
Singhal, Amit, Salton, Gerard, Mitra, Mandar, Buckley, Chris
In the TREC collection -- a large full-text experimental text collection with widely varying document lengths -- we observe that the likelihood of a document being judged relevant by a user increases...
Document Length Normalization (1995)
Singhal, Amit, Salton, Gerard, Mitra, Mandar, Buckley, Chris
In the TREC collection -- a large full-text experimental text collection with widely varying document lengths -- we observe that the likelihood of a document being judged relevant by a user increases...
Length Normalization in Degraded Text Collections (1995)
Singhal, Amit, Salton, Gerard, Buckley, Chris
Optical character recognition (OCR) is the most commonly used technique to convert printed material into electronic form. Using OCR, large repositories of machine readable text can be created in a...
Length Normalization in Degraded Text Collections (1995)
Singhal, Amit, Salton, Gerard, Buckley, Chris
Optical character recognition (OCR) is the most commonly used technique to convert printed material into electronic form. Using OCR, large repositories of machine readable text can be created in a...
Selective Text Utilization and Text Traversal (1995)
Many large collections of full-text documents are currently stored in machine-readable form and processed automatically in various ways. These collections may include different types of documents,...
Automatic Text Browsing Using Vector Space Model (1995)
Vast amounts of text are now available in machinereadable form and can be processed electronically. The vector space model of text processing has been widely used and has consistently produced...
Length Normalization in Degraded Text Collections (1995)
Amit Singhal, Gerard Salton, Chris Buckley
Optical character recognition (OCR) is the most commonly used technique to convert printed material into electronic form. Using OCR, large repositories of machine readable text can be created in a...
Automatic Text Theme Generation and the Analysis of Text Structure (1994)
Non-expository texts are not usually read from cover to cover. Readers are helped in such circumstances by providing selective access to text excerpts as needed. Text themes can be identified...
Automatic Text Theme Generation and the Analysis of Text Structure (1994)
Non-expository texts are not usually read from cover to cover. Readers are helped in such circumstances by providing selective access to text excerpts as needed. Text themes can be identified...
Automatic Text Decomposition and Structuring (1994)
Sophisticated text similarity measurements are used to determine relationships between naturallanguage texts and text segments. The resulting linked hypertext maps are used to identify different text...
The Effect of Adding Relevance Information in a Relevance Feedback Environment (1994)
Chris Buckley, Gerard Salton, James Allan
The effects of adding information from relevant documents are examined in the TREC routing environment. A modified Rocchio relevance feedback approach is used, with a varying number of relevant...
Automatic Routing and Ad-hoc Retrieval Using SMART : TREC 2 (1994)
Chris Buckley, James Allan, Gerard Salton
The Smart information retrieval project emphasizes completely automatic approaches to the understanding and retrieval of large quantities of text. We continue our work in the TREC 2 environment,...
Selective Text Utilization and Text Traversal (1993)
Many large collections of full-text documents are currently stored in machine-readable form and processed automatically in various ways. These collections may include different types of documents,...
Selective Text Utilization and Text Traversal (1993)
Many large collections of full-text documents are currently stored in machine-readable form and processed automatically in various ways. These collections may include different types of documents,...
Approaches to Passage Retrieval in Full Text Information Systems (1993)
Salton, Gerard, Allan, James, Buckley, Chris
Large collections of full-text documents are now commonly used in automated information retrieval. When the stored document texts are long, the retrieval of complete documents may not be in the...
Approaches to Passage Retrieval in Full Text Information Systems (1993)
Salton, Gerard, Allan, James, Buckley, Chris
Large collections of full-text documents are now commonly used in automated information retrieval. When the stored document texts are long, the retrieval of complete documents may not be in the...
Selective text utilization and text traversal (1993)
Many large collections of full-text documents are currently stored in machine-readable form and processed automatically in various ways. These collections may include different types of documents,...
Selective Use of Full-Text Databases (1992)
Salton, Gerard, Allan, James, Buckley, Chris
Large files of natural-language text are now available for automatic processing in machine readable form. Such text files may include documents of textbook size, medium-size newspaper articles and...
Selective Use of Full-Text Databases (1992)
Salton, Gerard, Allan, James, Buckley, Chris
Large files of natural-language text are now available for automatic processing in machine readable form. Such text files may include documents of textbook size, medium-size newspaper articles and...
Automatic Structuring and Retrieval of Large Text Files (1992)
Salton, Gerard, Allan, James, Buckley, Chris
In many operational environments, large text files must be processed covering a wide variety of different topic areas. Aids must then be provided to the user that permit collection browsing and make...
Automatic Structuring and Retrieval of Large Text Files (1992)
Salton, Gerard, Allan, James, Buckley, Chris
In many operational environments, large text files must be processed covering a wide variety of different topic areas. Aids must then be provided to the user that permit collection browsing and make...
Automatic Structuring of Text Files (1991)
Salton, Gerard, Buckley, Chris, Allan, James
In many practical information retrieval situations, it is necessary to process heterogeneous text databases that vary greatly in scope and coverage, and deal with many different subjects. In such an...
Automatic Structuring of Text Files (1991)
Salton, Gerard, Buckley, Chris, Allan, James
In many practical information retrieval situations, it is necessary to process heterogeneous text databases that vary greatly in scope and coverage, and deal with many different subjects. In such an...
The State of Retrieval System Evaluation (1991)
Substantial misgivings have been voiced over the years about the methodologies used to evaluate information retrieval procedures, and about the credibility of many of the available test results. In...
The State of Retrieval System Evaluation (1991)
Substantial misgivings have been voiced over the years about the methodologies used to evaluate information retrieval procedures, and about the credibility of many of the available test results. In...
Automatic Text Structuring and Retrieval- Experiments in Automatic Encyclopedia Searching (1991)
Salton, Gerard, Buckley, Chris
Many conventional approaches to text analysis and information retrieval prove ineffective when large text collections must be processed in heterogeneous subject areas. An alternative text...
Automatic Text Structuring and Retrieval- Experiments in Automatic Encyclopedia Searching (1991)
Salton, Gerard, Buckley, Chris
Many conventional approaches to text analysis and information retrieval prove ineffective when large text collections must be processed in heterogeneous subject areas. An alternative text...
A Note on Term Weighting and Text Matching (1990)
Salton, Gerard, Buckley, Chris
In information retrieval, it is not uncommon to be faced with large collections of unrestricted natural-language text. In such circumstances, the text analysis and retrieval operations must be based...
A Note on Term Weighting and Text Matching (1990)
Salton, Gerard, Buckley, Chris
In information retrieval, it is not uncommon to be faced with large collections of unrestricted natural-language text. In such circumstances, the text analysis and retrieval operations must be based...
Flexible Text Matching for Information Retrieval (1990)
Salton, Gerard, Buckley, Chris
Very large text databases now exist in machine-readable form, covering arbitrary subject matter in unrestricted discourse areas. The conventional text retrieval approaches are not easily used in such...
Flexible Text Matching for Information Retrieval (1990)
Salton, Gerard, Buckley, Chris
Very large text databases now exist in machine-readable form, covering arbitrary subject matter in unrestricted discourse areas. The conventional text retrieval approaches are not easily used in such...
A Simple Syntactic Approach for the Generation of Indexing Phrases (1990)
Salton, Gerard, Zhao, Zhongnan, Buckley, Chris
A syntactic approach is described for generating indexing phrases usable for the content identification of natural-language texts. The phrase generation method is based on a simple language analysis...
A Simple Syntactic Approach for the Generation of Indexing Phrases (1990)
Salton, Gerard, Zhao, Zhongnan, Buckley, Chris
A syntactic approach is described for generating indexing phrases usable for the content identification of natural-language texts. The phrase generation method is based on a simple language analysis...
An Evaluation of Text Matching Systems for Text Excerpts of Varying Scope (1990)
Salton, Gerard, Buckley, Chris
When large text collections must be processed, it is not possible to limit the scope of the subject matter of interest. In such a situation the standard content analysis methods that are based on the...
An Evaluation of Text Matching Systems for Text Excerpts of Varying Scope (1990)
Salton, Gerard, Buckley, Chris
When large text collections must be processed, it is not possible to limit the scope of the subject matter of interest. In such a situation the standard content analysis methods that are based on the...
Text Linking and Retrieval Experiments for Textbook Components (1990)
Salton, Gerard, Buckley, Chris, Zhao, Zhongnan
Experiments are described designed to retrieve individual paragraphs of textbook material in answer to user-submitted queries. The retrieval strategies are based on the global comparison of paragraph...
Text Linking and Retrieval Experiments for Textbook Components (1990)
Salton, Gerard, Buckley, Chris, Zhao, Zhongnan
Experiments are described designed to retrieve individual paragraphs of textbook material in answer to user-submitted queries. The retrieval strategies are based on the global comparison of paragraph...
Approaches to Global Text Analysis (1990)
Salton, Gerard, Buckley, Chris
The current approaches to the analysis of natural language text are not viable for documents of unrestricted scope. A global text analysis system is proposed designed to identify homogeneous text...
Approaches to Global Text Analysis (1990)
Salton, Gerard, Buckley, Chris
The current approaches to the analysis of natural language text are not viable for documents of unrestricted scope. A global text analysis system is proposed designed to identify homogeneous text...
Approaches to Text Retreival for Structured Documents (1990)
Salton, Gerard, Buckley, Chris
Documents such as textbooks, dictionaries, and encyclopedias are inherently structured, in the sense that they are meant to be used selectively by skipping from section to section instead of reading...
Approaches to Text Retreival for Structured Documents (1990)
Salton, Gerard, Buckley, Chris
Documents such as textbooks, dictionaries, and encyclopedias are inherently structured, in the sense that they are meant to be used selectively by skipping from section to section instead of reading...
Improving retrieval performance by relevance feedback (1990)
Relevance feedback is an automatic process, introduced over 20 years ago, designed to produce improved query formulations following an initial retrieval operation. The principal relevance feedback...
A Comparison of Book Indexing Methods (1989)
Several book indexing methods are introduced based in part on statistical and in part on syntactic analysis methods to process the text of documents. These indexing methods are used to construct...
A Comparison of Book Indexing Methods (1989)
Several book indexing methods are introduced based in part on statistical and in part on syntactic analysis methods to process the text of documents. These indexing methods are used to construct...
A Comparison Between Statistically and Sytactically Generated Term Phrases (1989)
Salton, Gerard, Buckley, Chris
It is customary to use single terms (words) or terms in context (phrases) as indexing units for the representation of natural-language text content. There is evidence that term phrases may provide...
A Comparison Between Statistically and Sytactically Generated Term Phrases (1989)
Salton, Gerard, Buckley, Chris
It is customary to use single terms (words) or terms in context (phrases) as indexing units for the representation of natural-language text content. There is evidence that term phrases may provide...
User-System Interaction in Automatic Information Retrieval (1989)
Salton, Gerard, Crouch, Donald B.
Information retrieval activities are now routinely conducted on-line under the control of search intermediaries or end users. Examples are presented of advanced aids that permit the user to control...
User-System Interaction in Automatic Information Retrieval (1989)
Salton, Gerard, Crouch, Donald B.
Information retrieval activities are now routinely conducted on-line under the control of search intermediaries or end users. Examples are presented of advanced aids that permit the user to control...
On the Use of Clustered File Organization in Information Search and Retrieval (1989)
Salton, Gerard, Araya, Jose E.
In modern retrieval environments, collection searches are normally conducted on-line under user control. Iterative collection searches can then be performed where tentative queries are initially...
Formalization and Evaluation of Linear Relevance Feedback (1989)
Wong, S. K. M., Yao, Y. Y., Salton, Gerard, Buckley, Chris
This study outlines an adaptive method which constructs improved query vectors based on the user preference judgments on sample document pairs. In particular, the user states that some documents are...
On the Automatic Generation of Content Links in Hypertext (1989)
Salton, Gerard, Buckley, Chris
Text structuring systems that provide links between text portions have been widely proposed as aids for text preparation and text manipulation. In principle, it is easy to follow available links...
On the Use of Clustered File Organization in Information Search and Retrieval (1989)
Salton, Gerard, Araya, Jose E.
In modern retrieval environments, collection searches are normally conducted on-line under user control. Iterative collection searches can then be performed where tentative queries are initially...
Formalization and Evaluation of Linear Relevance Feedback (1989)
Wong, S. K. M., Yao, Y. Y., Salton, Gerard, Buckley, Chris
This study outlines an adaptive method which constructs improved query vectors based on the user preference judgments on sample document pairs. In particular, the user states that some documents are...
On the Automatic Generation of Content Links in Hypertext (1989)
Salton, Gerard, Buckley, Chris
Text structuring systems that provide links between text portions have been widely proposed as aids for text preparation and text manipulation. In principle, it is easy to follow available links...
A Syntactic Approach to Automatic Book Indexing (1989)
Automatic publishing systems are now widely used to produce books and documents of many types. As a result, large masses of text become available in machine readable form for automatic processing....
A Syntactic Approach to Automatic Book Indexing (1989)
Automatic publishing systems are now widely used to produce books and documents of many types. As a result, large masses of text become available in machine readable form for automatic processing....
On the Application of Syntactic Methodologies in Automatic Text Analysis (1988)
This study summarizes various linguistic approaches proposed for document analysis in information retrieval environments. Included are standard syntactic methods to generate complex content...
On the Application of Syntactic Methodologies in Automatic Text Analysis (1988)
This study summarizes various linguistic approaches proposed for document analysis in information retrieval environments. Included are standard syntactic methods to generate complex content...
On the Use of Spreading Activation Methods in Automatic Information Retrieval (1988)
Salton, Gerard, Buckley, Chris
Spreading activation methods have been recommended in information retrieval to expand the search vocabulary and to complement the retrieved document sets. The spreading activation strategy is...
On the Use of Spreading Activation Methods in Automatic Information Retrieval (1988)
Salton, Gerard, Buckley, Chris
Spreading activation methods have been recommended in information retrieval to expand the search vocabulary and to complement the retrieved document sets. The spreading activation strategy is...
Improving Retrieval Performance by Relevance Feedback (1988)
Salton, Gerard, Buckley, Chris
Relevance feedback is an automatic process, introduced over 20 years ago, designed to produce query formulations following an initial retrieval operation. The principal relevance feedback methods...
Improving Retrieval Performance by Relevance Feedback (1988)
Salton, Gerard, Buckley, Chris
Relevance feedback is an automatic process, introduced over 20 years ago, designed to produce query formulations following an initial retrieval operation. The principal relevance feedback methods...
Term-weighting approaches in automatic text retrieval (1988)
Gerard Salton, Christopher Buckley
Abstract-The experimental evidence accumulated over the past 20 years indicates that text indexing systems based on the assignment of appropriately weighted single terms produce retrieval results...
Parallel Text Search Methods (1988)
A comparison of recently proposed parallel text search methods to alternative available search strategies that use serial processing machines suggests parallel methods do not provide large-scale...
Term Weighting Approaches in Automatic Text Retrieval (1987)
Salton, Gerard, Buckley, Chris
The experimental evidence accumulated over the past 20 years indicates that textindexing systems based on the assignment of appropriately weighted single terms produce retrieval results that are...
Term Weighting Approaches in Automatic Text Retrieval (1987)
Salton, Gerard, Buckley, Chris
The experimental evidence accumulated over the past 20 years indicates that textindexing systems based on the assignment of appropriately weighted single terms produce retrieval results that are...
Historical Note: The Past Thirty Years in Information Retrieval (1987)
The documentation literature of the nineteen fifties is reviewed briefly, and some early text processing endeavors are discussed. Various predictions made in 1960 by Mooers about the creative role of...
Parallel Text Search Methods (1987)
Salton, Gerard, Buckley, Chris
An evaluation of recently proposed parallel text search methods does not support the notion that the parallel methods provide large-scale gains in either retrieval effectiveness or efficiency,...
Historical Note: The Past Thirty Years in Information Retrieval (1987)
The documentation literature of the nineteen fifties is reviewed briefly, and some early text processing endeavors are discussed. Various predictions made in 1960 by Mooers about the creative role of...
Parallel Text Search Methods (1987)
Salton, Gerard, Buckley, Chris
An evaluation of recently proposed parallel text search methods does not support the notion that the parallel methods provide large-scale gains in either retrieval effectiveness or efficiency,...
Enhancement of Text Representations Using Related Document Titles (1986)
Various attempts have been made over the years to construct enhanced document representations by using thesauruses of related terms, term association maps, or knowledge frameworks that can be used to...
Enhancement of Text Representations Using Related Document Titles (1986)
Various attempts have been made over the years to construct enhanced document representations by using thesauruses of related terms, term association maps, or knowledge frameworks that can be used to...
Another Look at Automatic Text Retrieval Systems (1985)
The characteristics of automatic text retrieval systems are briefly described, and the available experimental evidence comparing manual with automatic retrieval is reviewed. Several automatic text...
Another Look at Automatic Text Retrieval Systems (1985)
The characteristics of automatic text retrieval systems are briefly described, and the available experimental evidence comparing manual with automatic retrieval is reviewed. Several automatic text...
A Note About Information Science Research (1984)
This note deals with the relationship between information science research and practice. The impression that the field is moribund and that the research output is uniformly inferior is not supported...
A Note About Information Science Research (1984)
This note deals with the relationship between information science research and practice. The impression that the field is moribund and that the research output is uniformly inferior is not supported...
Automatic Assignment of Soft Boolean Operators (1984)
Salton, Gerard, Voorhees, Ellen M.
The conventional bibliographic retrieval systems are based on Boolean query formulations and inverted file implementations. Such systems provide rapid responses in answer to search queries but they...
Some Notions about Information Retrieval in Automated Office Environments (1984)
A substantial portion of the work required in office environments involves the processing of natural language texts. This note contains some ideas relating to the structure of natural language texts...
Automatic Assignment of Soft Boolean Operators (1984)
Salton, Gerard, Voorhees, Ellen M.
The conventional bibliographic retrieval systems are based on Boolean query formulations and inverted file implementations. Such systems provide rapid responses in answer to search queries but they...
Some Notions about Information Retrieval in Automated Office Environments (1984)
A substantial portion of the work required in office environments involves the processing of natural language texts. This note contains some ideas relating to the structure of natural language texts...
The Use of Extended Boolean Logic in Information Retrieval (1984)
An extended Boolean retrieval strategy has previously been introduced in which the individual Boolean operators can be treated more or less strictly, depending on the perceived strength of...
The Use of Extended Boolean Logic in Information Retrieval (1984)
An extended Boolean retrieval strategy has previously been introduced in which the individual Boolean operators can be treated more or less strictly, depending on the perceived strength of...
Howard Aiken's Children: The Harvard Computation Laboratory and its Students (1983)
NO ABSTRACT SUPPLIED
Howard Aiken's Children: The Harvard Computation Laboratory and its Students (1983)
NO ABSTRACT SUPPLIED
Advanced Feedback Methods in Information Retrieval (1983)
Salton, Gerard, Fox, Edward A., Voorhees, Ellen M.
Automatic feedback methods may be used in on-line information retrieval to generate improved query statements based on information contained in previously retrieved documents. In this study automatic...
Advanced Feedback Methods in Information Retrieval (1983)
Salton, Gerard, Fox, Edward A., Voorhees, Ellen M.
Automatic feedback methods may be used in on-line information retrieval to generate improved query statements based on information contained in previously retrieved documents. In this study automatic...
A Comparison of Two Methods for Boolean Query Relevance Feedback (1983)
Salton, Gerard, Fox, Edward A., Voorhees, Ellen M.
The relevance feedback process uses information derived from an initially retrieved set of documents to improve subsequent search formulations and retrieval output. In a Boolean query environment...
A Comparison of Two Methods for Boolean Query Relevance Feedback (1983)
Salton, Gerard, Fox, Edward A., Voorhees, Ellen M.
The relevance feedback process uses information derived from an initially retrieved set of documents to improve subsequent search formulations and retrieval output. In a Boolean query environment...
A Generalized Term Dependence Model in Information Retrieval (1983)
Yu, C. T., Buckley, Chris, Lam, K., Salton, Gerard
The tree dependence model has been used successfully to incorporate dependencies between certain term pairs on the information retrieval process, while the Bahadur Lazarsfeld Expansion (BLE) which...
A Generalized Term Dependence Model in Information Retrieval (1983)
Yu, C. T., Buckley, Chris, Lam, K., Salton, Gerard
The tree dependence model has been used successfully to incorporate dependencies between certain term pairs on the information retrieval process, while the Bahadur Lazarsfeld Expansion (BLE) which...
Boolean Query Formulation with Relevance Feedback (1983)
Salton, Gerard, Fox, Edward A., Buckley, Chris, Voorhees, Ellen M.
The well-known relevance feedback process uses information extracted from previously retrieved relevant documents to generate improved search formulations for subsequent search iterations. Methods...
Boolean Query Formulation with Relevance Feedback (1983)
Salton, Gerard, Fox, Edward A., Buckley, Chris, Voorhees, Ellen M.
The well-known relevance feedback process uses information extracted from previously retrieved relevant documents to generate improved search formulations for subsequent search iterations. Methods...
Automatic Query Formulations in Information Retrieval (1982)
Salton, Gerard, Buckley, Chris, Fox, Edward A.
Modern information retrieval systems are designed to supply relevant information in response to requests received from the user population. In most retrieval environments the search requests consist...
Automatic Query Formulations in Information Retrieval (1982)
Salton, Gerard, Buckley, Chris, Fox, Edward A.
Modern information retrieval systems are designed to supply relevant information in response to requests received from the user population. In most retrieval environments the search requests consist...
Extended Boolean Information Retrieval (1982)
Salton, Gerard, Fox, Edward A., Wu, Harry
In conventional information retrieval Boolean combinations of index terms are used to formulate the users' information requests. While any document is in principle retrievable by a Boolean query, the...
Extended Boolean Information Retrieval (1982)
Salton, Gerard, Fox, Edward A., Wu, Harry
In conventional information retrieval Boolean combinations of index terms are used to formulate the users' information requests. While any document is in principle retrievable by a Boolean query, the...
A Comparison of Search Term Weighting: Term Relevance vs. Inverse Document Frequency (1981)
The term relevance weighting method has been shown to produce optimal information retrieval queries under well-defined conditions. The parameters needed to generate the term relevance factors cannot...
A Comparison of Search Term Weighting: Term Relevance vs. Inverse Document Frequency (1981)
The term relevance weighting method has been shown to produce optimal information retrieval queries under well-defined conditions. The parameters needed to generate the term relevance factors cannot...
Parallel Computations in Information Retrieval (1980)
Conventional information retrieval processes are largely based on data movement, pointer manipulations and integer arithmetic; more refined retrieval algorithms may in addition benefit from...
Parallel Computations in Information Retrieval (1980)
Conventional information retrieval processes are largely based on data movement, pointer manipulations and integer arithmetic; more refined retrieval algorithms may in addition benefit from...
A Progress Report on Automatic Information Retrieval (1979)
This study is a state-of-the-art report of work in automatic information retrieval. Various enhancements of operational on-line retrieval systems are described such as the utilization of special...
A Progress Report on Automatic Information Retrieval (1979)
This study is a state-of-the-art report of work in automatic information retrieval. Various enhancements of operational on-line retrieval systems are described such as the utilization of special...
A standard approach is introduced for the representation of information content in data base and document retrieval environments. The use of composite concept vectors representing individual...
A Citation Study of the Computer Science Literature (1979)
The bibliographic references and citations which exist between documents in a given collection environment can be used to study the history and scope of particular subject areas and to assess the...
A standard approach is introduced for the representation of information content in data base and document retrieval environments. The use of composite concept vectors representing individual...
A Citation Study of the Computer Science Literature (1979)
The bibliographic references and citations which exist between documents in a given collection environment can be used to study the history and scope of particular subject areas and to assess the...
Mathematics and Information Retrieval (1978)
The development of a given discipline in science and technology often depends on the availability of theories capable of describing the processes which control the field and of modelling the...
Mathematics and Information Retrieval (1978)
The development of a given discipline in science and technology often depends on the availability of theories capable of describing the processes which control the field and of modelling the...
Term Relevance Weights in On-Line Information Retrieval (1977)
Salton, Gerard, Waldstein, R. K.
Considerable evidence exists to show that the use of term relevance weights is beneficial in interactive information retrieval. Various term weighting systems are reviewed. An experiment is then...
Term Relevance Weights in On-Line Information Retrieval (1977)
Salton, Gerard, Waldstein, R. K.
Considerable evidence exists to show that the use of term relevance weights is beneficial in interactive information retrieval. Various term weighting systems are reviewed. An experiment is then...
A Computer Science View Obtained by Automatic Document Processing (1977)
Automatic document classification techniques have been widely advocated for the study of various fields of learning, the identification of individual research topics and of influential contributors...
A Computer Science View Obtained by Automatic Document Processing (1977)
Automatic document classification techniques have been widely advocated for the study of various fields of learning, the identification of individual research topics and of influential contributors...
Generation and Search of Clustered Files (1977)
Bergmark, D., Salton, Gerard, Wong, A.
A classified, or clustered file is one where related, or similar records are grouped into classes, or clusters of items in such a way that all items within a cluster are jointly retrievable....
Generation and Search of Clustered Files (1977)
Bergmark, D., Salton, Gerard, Wong, A.
A classified, or clustered file is one where related, or similar records are grouped into classes, or clusters of items in such a way that all items within a cluster are jointly retrievable....
Clustered File Generation and Its Application to Computer Science Taxonomies (1976)
A clustered file organization is one where related, or similar records are grouped into classes, or clusters of items in such a way that all items within a cluster are jointly retrievable. Such a...
Clustered File Generation and Its Application to Computer Science Taxonomies (1976)
A clustered file organization is one where related, or similar records are grouped into classes, or clusters of items in such a way that all items within a cluster are jointly retrievable. Such a...
Effective Automatic Indexing Using Single Terms, Term Phrases and Thesaurus Class Assignments (1976)
Yu, C. T., Salton, Gerard, Siu, M. K.
In a retrieval environment, indexing is the task which consists in the assignment to stored records and incoming information requests of content identifiers capable of representing record or query...
Effective Automatic Indexing Using Single Terms, Term Phrases and Thesaurus Class Assignments (1976)
Yu, C. T., Salton, Gerard, Siu, M. K.
In a retrieval environment, indexing is the task which consists in the assignment to stored records and incoming information requests of content identifiers capable of representing record or query...
Cataloging Software Packages for Automatic Document Processing (1976)
Considerable advances have been made in the last few years in the development of hardware structures useful for the implementation of real-time document processing systems and on-line information...
Cataloging Software Packages for Automatic Document Processing (1976)
Considerable advances have been made in the last few years in the development of hardware structures useful for the implementation of real-time document processing systems and on-line information...
A Comparison of Term Value Measurements for Automatic Indexing (1975)
A number of automatic theories have been proposed over the last few years leading to the assignment of significance values to linguistic entities in accordance with their importance for purposes of...
The Effectiveness of the Thesaurus Method in Automatic Information Retrieval (1975)
Term grouping and thesaurus methods have frequently been incorporated into automatic content analysis programs as devices for the recognition of synonymous expressions and of linguistic entities that...
A Comparison of Term Value Measurements for Automatic Indexing (1975)
A number of automatic theories have been proposed over the last few years leading to the assignment of significance values to linguistic entities in accordance with their importance for purposes of...
The Effectiveness of the Thesaurus Method in Automatic Information Retrieval (1975)
Term grouping and thesaurus methods have frequently been incorporated into automatic content analysis programs as devices for the recognition of synonymous expressions and of linguistic entities that...
Effective Information Retrieval Using Term Accuracy (1975)
The performance of information retrieval systems can be evaluated in a number of different ways. Much of the published evaluation work is based on measuring the retrieval performance of an average...
Automatic Indexing Using Term Discrimination and Term Precision Measurements (1975)
Salton, Gerard, Wong, A., Yu, C. T.
A variety of abstract automatic indexing models have been developed in recent times in an effort to produce indexing methods that are both effective and usable in practice. Among these are thee term...
Effective Information Retrieval Using Term Accuracy (1975)
The performance of information retrieval systems can be evaluated in a number of different ways. Much of the published evaluation work is based on measuring the retrieval performance of an average...
Automatic Indexing Using Term Discrimination and Term Precision Measurements (1975)
Salton, Gerard, Wong, A., Yu, C. T.
A variety of abstract automatic indexing models have been developed in recent times in an effort to produce indexing methods that are both effective and usable in practice. Among these are thee term...
On the Role of Words and Phrases in Automatic Text Analysis (1975)
One of the most crucial operations in automatic information retrieval is the assignment to written texts and documents of appropriate identifiers, capable of representing information content for...
On the Role of Words and Phrases in Automatic Text Analysis (1975)
One of the most crucial operations in automatic information retrieval is the assignment to written texts and documents of appropriate identifiers, capable of representing information content for...
The Information Revolution - Myth or Reality (1975)
The early work in the design of automatic information systems, exemplified by the contributions of H. P. Luhn and others, now goes back nearly twenty years. It may be useful to ask whether a great...
The Information Revolution - Myth or Reality (1975)
The early work in the design of automatic information systems, exemplified by the contributions of H. P. Luhn and others, now goes back nearly twenty years. It may be useful to ask whether a great...
Precision Weighting - An Effective Automatic Indexing Method (1975)
A great many automatic indexing methods have been implemented and evaluated over the last few years, and automatic procedures comparable in effectiveness to conventional manual ones are now easy to...
Precision Weighting - An Effective Automatic Indexing Method (1975)
A great many automatic indexing methods have been implemented and evaluated over the last few years, and automatic procedures comparable in effectiveness to conventional manual ones are now easy to...
Dynamic information and library processing / Gerard Salton (1975)
Incluye bibliografía
A Theory of Term Importance in Automatic Text Analysis (1974)
Salton, Gerard, Yang, C. S., Yu, C. T.
Most existing automatic content analysis and indexing techniques are based on word frequency characteristics applied largely in an ad hoc manner. Contradictory requirements arise in this connection,...
A Vector Space Model for Automatic Indexing (1974)
Salton, Gerard, Wong, A., Yang, C. S.
In a document retieval, or other pattern matching environment where stored entities (documents) are compared with each other, or with incoming patterns (search requests), it appears that the best...
A Theory of Term Importance in Automatic Text Analysis (1974)
Salton, Gerard, Yang, C. S., Yu, C. T.
Most existing automatic content analysis and indexing techniques are based on word frequency characteristics applied largely in an ad hoc manner. Contradictory requirements arise in this connection,...
A Vector Space Model for Automatic Indexing (1974)
Salton, Gerard, Wong, A., Yang, C. S.
In a document retieval, or other pattern matching environment where stored entities (documents) are compared with each other, or with incoming patterns (search requests), it appears that the best...
THe content analysis, or indexing problem, is fundamental in information storage and retrieval. Several automatic procedures are examined for the assignment of significance values to the terms, or...
THe content analysis, or indexing problem, is fundamental in information storage and retrieval. Several automatic procedures are examined for the assignment of significance values to the terms, or...
Contribution to the Theory of Indexing (1973)
Salton, Gerard, Yang, C. S., Yu, C. T.
An attempt is made to characterize the usefulness of terms occurring in stored documents and user queries as a function of their frequency characteristics across the documents of a collection. It is...
Contribution to the Theory of Indexing (1973)
Salton, Gerard, Yang, C. S., Yu, C. T.
An attempt is made to characterize the usefulness of terms occurring in stored documents and user queries as a function of their frequency characteristics across the documents of a collection. It is...
On the Specification of Term Values in Automatic Indexing (1973)
The existing practice in automatic indexing is reviewed, and it is shown that the standard theories for the specification of term values (or weights) are not adequate. New techniques are introduced...
On the Specification of Term Values in Automatic Indexing (1973)
The existing practice in automatic indexing is reviewed, and it is shown that the standard theories for the specification of term values (or weights) are not adequate. New techniques are introduced...
Engineering: Cornell Quarterly, Vol.08, No.3 (Autumn 1973): The Science of Information (1973)
Conway, Richard, Salton, Gerard, Pottle, Christopher, Lewis, David
IN THIS ISSUE: The Nature of Information: The Province of Computer Science in the University Today /2 (Not machines but knowledge about information itself is the main concern of computer science...
Engineering: Cornell Quarterly, Vol.08, No.3 (Autumn 1973): The Science of Information (1973)
Conway, Richard, Salton, Gerard, Pottle, Christopher, Lewis, David
IN THIS ISSUE: The Nature of Information: The Province of Computer Science in the University Today /2 (Not machines but knowledge about information itself is the main concern of computer science...
Experiments in Multi-Lingual Information Retrieval (1972)
A comparison was made of the performance in an automatic information retrieval environment of user queries and document abstracts available in natural language form in both English and French. The...
Experiments in Multi-Lingual Information Retrieval (1972)
A comparison was made of the performance in an automatic information retrieval environment of user queries and document abstracts available in natural language form in both English and French. The...
Automatic Processing of Current Affairs Queries (1972)
The SMART system is used for the analysis, search, and retrieval of news stories appearing in Time magazine. A comparison is made between the automatic text processing methods incorporated into the...
Proposals for a Dynamic Library (1972)
The current library environment is first examined, and an attempt is made to explain why the standard approaches to the library problem have been less productive than had been anticipated. A new...
Automatic Processing of Current Affairs Queries (1972)
The SMART system is used for the analysis, search, and retrieval of news stories appearing in Time magazine. A comparison is made between the automatic text processing methods incorporated into the...
Proposals for a Dynamic Library (1972)
The current library environment is first examined, and an attempt is made to explain why the standard approaches to the library problem have been less productive than had been anticipated. A new...
Dynamic Document Processing (1972)
The current role of computers in automatic document processing is briefly outlined, and some reasons are given why the early promise of library automation and of the mechanization of documentation...
Dynamic Document Processing (1972)
The current role of computers in automatic document processing is briefly outlined, and some reasons are given why the early promise of library automation and of the mechanization of documentation...
A New Comparison Between Conventional Indexing (MEDLARS) andAutomatic Text Processing (SMART) (1971)
A new testing process is described designed to compare conventional retrieval (MEDLARS) and automatic text analysis methods (SMART). The results obtained with a collection of documents chosen...
A New Comparison Between Conventional Indexing (MEDLARS) andAutomatic Text Processing (SMART) (1971)
A new testing process is described designed to compare conventional retrieval (MEDLARS) and automatic text analysis methods (SMART). The results obtained with a collection of documents chosen...
The "Generality" Effect and the Retrieval Evaluation forLarge Collections (1970)
The retrieval effectiveness of large document collections is normally assessed by using small subsections of the file for test purposes, and extrapolating the data upward to represent the results for...
The "Generality" Effect and the Retrieval Evaluation forLarge Collections (1970)
The retrieval effectiveness of large document collections is normally assessed by using small subsections of the file for test purposes, and extrapolating the data upward to represent the results for...
Evaluation Problems in Interactive Information Retrieval (1969)
Interactive retrieval procedures are normally based on rapidly accessible files. Special storage organizations and file search techniques are used, and the system user is made to fulfill an important...
Interactive Information Retrieval (1969)
The advent of time-sharing computer organizations and input-output console equipment has made it possible to experiment with interactive information handling methods in which the user takes on...
Evaluation Problems in Interactive Information Retrieval (1969)
Interactive retrieval procedures are normally based on rapidly accessible files. Special storage organizations and file search techniques are used, and the system user is made to fulfill an important...
Interactive Information Retrieval (1969)
The advent of time-sharing computer organizations and input-output console equipment has made it possible to experiment with interactive information handling methods in which the user takes on...
Automatic Text Analysis (1969)
Effective automatic methods are now available to replace conventional document indexing and classification.
Automatic Text Analysis (1969)
Effective automatic methods are now available to replace conventional document indexing and classification.
Dynamic File Organization and Heuristic SearchStrategies in Information Retrieval (1969)
A great deal of effort has been devoted in recent years to the evaluation of automatic or semi-automatic information retrieval systems. Recent evaluation results indicate that the search...
Dynamic File Organization and Heuristic SearchStrategies in Information Retrieval (1969)
A great deal of effort has been devoted in recent years to the evaluation of automatic or semi-automatic information retrieval systems. Recent evaluation results indicate that the search...
Relevance Assessments and Retrieval System Evaluation (1968)
Two widely used criteria for evaluating the effectiveness of information retrieval systems are, respectively, the recall and the precision. Since the determination of these measures is dependent on a...
Relevance Assessments and Retrieval System Evaluation (1968)
Two widely used criteria for evaluating the effectiveness of information retrieval systems are, respectively, the recall and the precision. Since the determination of these measures is dependent on a...
A Comparison Between Manual and Automatic Indexing Methods (1968)
The effectiveness of conventional document indexing is compared with that achievable by fully-automatic text processing methods. Evaluation results are given for a comparison between the MEDLARS...
The Use of Standardized Documentary Data in AutomaticInformation Dissemination (1968)
It is likely that future operating information retrieval systems may be based on automatic information analysis methods instead of manual indexing, and on search procedures which allow the user to...
A Comparison Between Manual and Automatic Indexing Methods (1968)
The effectiveness of conventional document indexing is compared with that achievable by fully-automatic text processing methods. Evaluation results are given for a comparison between the MEDLARS...
The Use of Standardized Documentary Data in AutomaticInformation Dissemination (1968)
It is likely that future operating information retrieval systems may be based on automatic information analysis methods instead of manual indexing, and on search procedures which allow the user to...
Automated Language Processing (1968)
This study covers recent developments in automatic text processing, including syntactic, semantic, and statistical language analysis methods. The emphasis is on applications in the areas of machine...
Search and Retrieval Experiments in Real-Time Information Retrieval (1968)
Future operating document retrieval systems may be based on fully-automatic information analysis methods instead of manual indexing, and on real-time search procedures which allow the user to...
Automated Language Processing (1968)
This study covers recent developments in automatic text processing, including syntactic, semantic, and statistical language analysis methods. The emphasis is on applications in the areas of machine...
Search and Retrieval Experiments in Real-Time Information Retrieval (1968)
Future operating document retrieval systems may be based on fully-automatic information analysis methods instead of manual indexing, and on real-time search procedures which allow the user to...
Automatic Content Analysis in Information Retrieval (1968)
The content analysis problem is first introduced, and some of the standard analysis procedures used in information retrieval are reviewed. The principal content analysis methods incorporated into the...
Automatic Content Analysis in Information Retrieval (1968)
The content analysis problem is first introduced, and some of the standard analysis procedures used in information retrieval are reviewed. The principal content analysis methods incorporated into the...
Automatic information organization and retrieval / Gerard Salton (1968)
Incluye bibliografía e índice
An automatic data processing system for public utility revenue accounting [microform]. (1958)
Thesis (Ph. D.)--Harvard University, 1958.
Automatic Text Decomposition Using Text Segments and Text Themes
Gerard Salton, Amit Singhal, Chris Buckley, Ar Mitra
With the widespread use of full-text information retrieval, passage-retrieval techniques are becoming increasingly popular. Larger texts can then be replaced by important text excerpts, thereby...
Automatic Text Decomposition Using Text Segments and Text Themes
Gerard Salton, Amit Singhal, Chris Buckley, Mandar Mitra, Ar Mitra
With the widespread use of full-text information retrieval, passage-retrieval techniques are becoming increasingly popular. Larger texts can then be replaced by important text excerpts, thereby...