eCommons

 

Automatic Structuring and Retrieval of Large Text Files

dc.contributor.authorSalton, Gerarden_US
dc.contributor.authorAllan, Jamesen_US
dc.contributor.authorBuckley, Chrisen_US
dc.date.accessioned2007-04-23T17:59:21Z
dc.date.available2007-04-23T17:59:21Z
dc.date.issued1992-06en_US
dc.description.abstractIn many operational environments, large text files must be processed covering a wide variety of different topic areas. Aids must then be provided to the user that permit collection browsing and make it possible to locate particular items on demand. The conventional text analysis methods based on preconstructed knowledge-bases and other vocabulary-control tools are difficult to apply when the subject coverage is unrestricted. An alternative approach, applicable to text collections in any subject area, is introduced which uses the document collections themselves as a basis for the text analysis, together with sophisticated text matching operations carried out at several levels of detail. Methods are described for relating semantically similar pieces of text, and for using the resulting hypertext structures for collection browsing and information retrieval.en_US
dc.format.extent3835074 bytes
dc.format.extent846725 bytes
dc.format.mimetypeapplication/pdf
dc.format.mimetypeapplication/postscript
dc.identifier.citationhttp://techreports.library.cornell.edu:8081/Dienst/UI/1.0/Display/cul.cs/TR92-1286en_US
dc.identifier.urihttps://hdl.handle.net/1813/7126
dc.language.isoen_USen_US
dc.publisherCornell Universityen_US
dc.subjectcomputer scienceen_US
dc.subjecttechnical reporten_US
dc.titleAutomatic Structuring and Retrieval of Large Text Filesen_US
dc.typetechnical reporten_US

Files

Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
92-1286.pdf
Size:
3.66 MB
Format:
Adobe Portable Document Format
No Thumbnail Available
Name:
92-1286.ps
Size:
826.88 KB
Format:
Postscript Files