JavaScript is disabled for your browser. Some features of this site may not work without it.
On the Completeness of Full-Text Search Languages for XML
dc.contributor.author | Botev, Chavdar | en_US |
dc.contributor.author | Amer-Yahia, Sihem | en_US |
dc.contributor.author | Shanmugasundaram, Jayavel | en_US |
dc.date.accessioned | 2007-04-04T19:28:30Z | |
dc.date.available | 2007-04-04T19:28:30Z | |
dc.date.issued | 2003-12-14 | en_US |
dc.identifier.citation | http://techreports.library.cornell.edu:8081/Dienst/UI/1.0/Display/cul.cis/TR2003-1917 | en_US |
dc.identifier.uri | https://hdl.handle.net/1813/5629 | |
dc.description.abstract | We study formal properties of full-text search languages for XML. Our main contribution is the development of a formal model for full-text search based on the positions of tokens in XML nodes. Building on this model, we define a full-text calculus based on first-order logic, and a full-text algebra based on the relational algebra. We show that the full-text calculus and algebra are equivalent even in the presence of arbitrary position-based predicates, such as distance predicates and phrase matching. This suggests a notion of completeness for full-text languages. None of the full-text search languages that we are aware of are complete under the above characterization. We propose a new full-text language that is complete and naturally generalizes existing full-text languages. Our formalization in terms of the relational model can also serve as the basis for (a) joint optimization of structured and full-text search queries, and (b) ranking full-text search query results by leveraging existing work on the probabilistic relational model. | en_US |
dc.format.extent | 152982 bytes | |
dc.format.mimetype | application/pdf | |
dc.language.iso | en_US | en_US |
dc.publisher | Cornell University | en_US |
dc.subject | computer science | en_US |
dc.subject | technical report | en_US |
dc.title | On the Completeness of Full-Text Search Languages for XML | en_US |
dc.type | technical report | en_US |