Cornell University
Library
Cornell UniversityLibrary

eCommons

Help
Log In(current)
  1. Home
  2. Cornell Computing and Information Science
  3. Computing and Information Science
  4. Computing and Information Science Technical Reports
  5. Efficient Keyword Search over Virtual XML Views

Efficient Keyword Search over Virtual XML Views

File(s)
TR2007-2077.pdf (699.09 KB)
Permanent Link(s)
https://hdl.handle.net/1813/5768
Collections
Computing and Information Science Technical Reports
Author
Shao, Feng
Guo, Lin
Botev, Chavdar
Bhaskar, Anand
Chettiah, Muthiah
Yang, Fan
Shanmugasundaram, Jayavel
Abstract

Emerging applications such as personalized portals, enterprise search and web integration systems often require keyword search over semi-structured views. However, traditional information retrieval techniques are likely to be expensive in this context because they rely on the assumption that the set of documents being searched is materialized. In this paper, we present a system architecture and algorithm that can efficiently evaluate keyword search queries over virtual (unmaterialized) XML views. An interesting aspect of our approach is that it exploits indices present on the base data and thereby avoids materializing large parts of the view that are not relevant to the query results. Another feature of the algorithm is that by solely using indices, we can still score the results for queries over the virtual view, and the resulting scores and rank order are the same as if the view was materialized. Our performance evaluation using the INEX data set in the Quark open-source XML database system indicates that the proposed approach is scalable and efficient.

Date Issued
2007-03-22
Publisher
Cornell University
Keywords
computer science
•
technical report
Previously Published as
http://techreports.library.cornell.edu:8081/Dienst/UI/1.0/Display/cul.cis/TR2007-2077
Type
technical report

Site Statistics | Help

About eCommons | Policies | Terms of use | Contact Us

copyright © 2002-2026 Cornell University Library | Privacy | Web Accessibility Assistance