Now showing items 42-61 of 87

    • Flexible Text Matching for Information Retrieval 

      Salton, Gerard; Buckley, Chris (Cornell University, 1990-09)
      Very large text databases now exist in machine-readable form, covering arbitrary subject matter in unrestricted discourse areas. The conventional text retrieval approaches are not easily used in such circumstances, because ...
    • Formalization and Evaluation of Linear Relevance Feedback 

      Wong, S. K. M.; Yao, Y. Y.; Salton, Gerard; Buckley, Chris (Cornell University, 1989-04)
      This study outlines an adaptive method which constructs improved query vectors based on the user preference judgments on sample document pairs. In particular, the user states that some documents are preferred to other ...
    • The "Generality" Effect and the Retrieval Evaluation forLarge Collections 

      Salton, Gerard (Cornell University, 1970-07)
      The retrieval effectiveness of large document collections is normally assessed by using small subsections of the file for test purposes, and extrapolating the data upward to represent the results for the full collection. ...
    • A Generalized Term Dependence Model in Information Retrieval 

      Yu, C. T.; Buckley, Chris; Lam, K.; Salton, Gerard (Cornell University, 1983-02)
      The tree dependence model has been used successfully to incorporate dependencies between certain term pairs on the information retrieval process, while the Bahadur Lazarsfeld Expansion (BLE) which specifies dependencies ...
    • Generation and Search of Clustered Files 

      Bergmark, D.; Salton, Gerard; Wong, A. (Cornell University, 1977-01)
      A classified, or clustered file is one where related, or similar records are grouped into classes, or clusters of items in such a way that all items within a cluster are jointly retrievable. Clustered files exhibit ...
    • Historical Note: The Past Thirty Years in Information Retrieval 

      Salton, Gerard (Cornell University, 1987-04)
      The documentation literature of the nineteen fifties is reviewed briefly, and some early text processing endeavors are discussed. Various predictions made in 1960 by Mooers about the creative role of computers in information ...
    • Howard Aiken's Children: The Harvard Computation Laboratory and its Students 

      Salton, Gerard (Cornell University, 1983-12)
      NO ABSTRACT SUPPLIED
    • Improving Retrieval Performance by Relevance Feedback 

      Salton, Gerard; Buckley, Chris (Cornell University, 1988-02)
      Relevance feedback is an automatic process, introduced over 20 years ago, designed to produce query formulations following an initial retrieval operation. The principal relevance feedback methods described over the years ...
    • The Information Revolution - Myth or Reality 

      Salton, Gerard (Cornell University, 1975-05)
      The early work in the design of automatic information systems, exemplified by the contributions of H. P. Luhn and others, now goes back nearly twenty years. It may be useful to ask whether a great deal has changed in ...
    • Interactive Information Retrieval 

      Salton, Gerard (Cornell University, 1969-08)
      The advent of time-sharing computer organizations and input-output console equipment has made it possible to experiment with interactive information handling methods in which the user takes on important functions both in ...
    • Length Normalization in Degraded Text Collections 

      Singhal, Amit; Salton, Gerard; Buckley, Chris (Cornell University, 1995-04)
      Optical character recognition (OCR) is the most commonly used technique to convert printed material into electronic form. Using OCR, large repositories of machine readable text can be created in a short time. An information ...
    • Mathematics and Information Retrieval 

      Salton, Gerard (Cornell University, 1978-01)
      The development of a given discipline in science and technology often depends on the availability of theories capable of describing the processes which control the field and of modelling the interactions between these ...
    • A New Comparison Between Conventional Indexing (MEDLARS) andAutomatic Text Processing (SMART) 

      Salton, Gerard (Cornell University, 1971-12)
      A new testing process is described designed to compare conventional retrieval (MEDLARS) and automatic text analysis methods (SMART). The results obtained with a collection of documents chosen independently of either SMART ...
    • A Note About Information Science Research 

      Salton, Gerard (Cornell University, 1984-09)
      This note deals with the relationship between information science research and practice. The impression that the field is moribund and that the research output is uniformly inferior is not supported by an examination of ...
    • A Note on Term Weighting and Text Matching 

      Salton, Gerard; Buckley, Chris (Cornell University, 1990-10)
      In information retrieval, it is not uncommon to be faced with large collections of unrestricted natural-language text. In such circumstances, the text analysis and retrieval operations must be based mainly on a study of ...
    • On the Application of Syntactic Methodologies in Automatic Text Analysis 

      Salton, Gerard; Smith, Maria (Cornell University, 1988-12)
      This study summarizes various linguistic approaches proposed for document analysis in information retrieval environments. Included are standard syntactic methods to generate complex content identifiers, and the use of ...
    • On the Automatic Generation of Content Links in Hypertext 

      Salton, Gerard; Buckley, Chris (Cornell University, 1989-04)
      Text structuring systems that provide links between text portions have been widely proposed as aids for text preparation and text manipulation. In principle, it is easy to follow available links between related text ...
    • On the Role of Words and Phrases in Automatic Text Analysis 

      Salton, Gerard; Wong, A. (Cornell University, 1975-06)
      One of the most crucial operations in automatic information retrieval is the assignment to written texts and documents of appropriate identifiers, capable of representing information content for search and retrieval ...
    • On the Specification of Term Values in Automatic Indexing 

      Salton, Gerard; Yang, C.S. (Cornell University, 1973-06)
      The existing practice in automatic indexing is reviewed, and it is shown that the standard theories for the specification of term values (or weights) are not adequate. New techniques are introduced for the assignment of ...
    • On the Use of Clustered File Organization in Information Search and Retrieval 

      Salton, Gerard; Araya, Jose E. (Cornell University, 1989-04)
      In modern retrieval environments, collection searches are normally conducted on-line under user control. Iterative collection searches can then be performed where tentative queries are initially processed, to be successively ...