Show simple item record

dc.contributor.authorBotev, Chavdaren_US
dc.contributor.authorAmer-Yahia, Sihemen_US
dc.contributor.authorShanmugasundaram, Jayavelen_US
dc.date.accessioned2007-04-04T19:28:30Z
dc.date.available2007-04-04T19:28:30Z
dc.date.issued2003-12-14en_US
dc.identifier.citationhttp://techreports.library.cornell.edu:8081/Dienst/UI/1.0/Display/cul.cis/TR2003-1917en_US
dc.identifier.urihttps://hdl.handle.net/1813/5629
dc.description.abstractWe study formal properties of full-text search languages for XML. Our main contribution is the development of a formal model for full-text search based on the positions of tokens in XML nodes. Building on this model, we define a full-text calculus based on first-order logic, and a full-text algebra based on the relational algebra. We show that the full-text calculus and algebra are equivalent even in the presence of arbitrary position-based predicates, such as distance predicates and phrase matching. This suggests a notion of completeness for full-text languages. None of the full-text search languages that we are aware of are complete under the above characterization. We propose a new full-text language that is complete and naturally generalizes existing full-text languages. Our formalization in terms of the relational model can also serve as the basis for (a) joint optimization of structured and full-text search queries, and (b) ranking full-text search query results by leveraging existing work on the probabilistic relational model.en_US
dc.format.extent152982 bytes
dc.format.mimetypeapplication/pdf
dc.language.isoen_USen_US
dc.publisherCornell Universityen_US
dc.subjectcomputer scienceen_US
dc.subjecttechnical reporten_US
dc.titleOn the Completeness of Full-Text Search Languages for XMLen_US
dc.typetechnical reporten_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

Statistics