DSpace at IIT Bombay
View Archive InfoMetadata
Field | Value |
Title | FleXPath: flexible structure and full-text querying for XML |
Names |
AMER-YAHIA, SIHEM
LAKSHMANAN, LAKS VS SHASHANK, PANDIT |
Date Issued | 2004 (iso8601) |
Abstract | Querying XML data is a well-explored topic with powerful database-style query languages such as XPath and XQuery set to become W3C standards. An equally compelling paradigm for querying XML documents is full-text search on textual content. In this paper, we study fundamental challenges that arise when we try to integrate these two querying paradigms. While keyword search is based on approximate matching, XPath has exact match semantics. We address this mismatch by considering queries on structure as a "template", and looking for answers that best match this template and the full-text search. To achieve this, we provide an elegant definition of relaxation on structure and define primitive operators to span the space of relaxations. Query answering is now based on ranking potential answers on structural and full-text search conditions. We set out certain desirable principles for ranking schemes and propose natural ranking schemes that adhere to these principles. We develop efficient algorithms for answering top-K queries and discuss results from a comprehensive set of experiments that demonstrate the utility and scalability of the proposed framework and algorithms. |
Topic | Query Languages |
Identifier | Proceedings of the ACM SIGMOD International Conference on Management of Data, Paris, France, 13-18 June 2004, 83-94 |