Automatic Content Analysis in Information Retrieval
The content analysis problem is first introduced, and some of the standard analysis procedures used in information retrieval are reviewed. The principal content analysis methods incorporated into the automatic SMART document retrieval system are then briefly examined and their effectiveness for information retrieval is discussed. Included in the system are word stem matching procedures, synonym recognition, phrase recognition, syntactic analysis, statistical term association techniques, and hierarchical expansion methods.
computer science; technical report
Previously Published As