eCommons

 

A Note on Term Weighting and Text Matching

dc.contributor.authorSalton, Gerarden_US
dc.contributor.authorBuckley, Chrisen_US
dc.date.accessioned2007-04-23T17:50:30Z
dc.date.available2007-04-23T17:50:30Z
dc.date.issued1990-10en_US
dc.description.abstractIn information retrieval, it is not uncommon to be faced with large collections of unrestricted natural-language text. In such circumstances, the text analysis and retrieval operations must be based mainly on a study of the text collections actually under construction. Two main operations are of interest: a text analysis operation designed to assign content identifiers to the stored texts, and a text comparison system designed to identify texts covering particular subject areas. In the present note, some details are given concerning the usefulness of term weighting systems for the content analysis of natural-language texts, and of text matching strategies designed to identify relevant text items in answer to available search requests. A sample collection of electronic mail messages is used for experimental purposes.en_US
dc.format.extent930466 bytes
dc.format.extent213501 bytes
dc.format.mimetypeapplication/pdf
dc.format.mimetypeapplication/postscript
dc.identifier.citationhttp://techreports.library.cornell.edu:8081/Dienst/UI/1.0/Display/cul.cs/TR90-1166en_US
dc.identifier.urihttps://hdl.handle.net/1813/7006
dc.language.isoen_USen_US
dc.publisherCornell Universityen_US
dc.subjectcomputer scienceen_US
dc.subjecttechnical reporten_US
dc.titleA Note on Term Weighting and Text Matchingen_US
dc.typetechnical reporten_US

Files

Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
90-1166.pdf
Size:
908.66 KB
Format:
Adobe Portable Document Format
No Thumbnail Available
Name:
90-1166.ps
Size:
208.5 KB
Format:
Postscript Files