Record Details

DSpace at IIT Bombay

View Archive Info
 

Metadata

 
Field Value
 
Title FleXPath: flexible structure and full-text querying for XML
 
Names AMER-YAHIA, SIHEM
LAKSHMANAN, LAKS VS
SHASHANK, PANDIT
Date Issued 2004 (iso8601)
Abstract Querying XML data is a well-explored topic with powerful database-style query languages such as XPath and XQuery set to become W3C standards. An equally compelling paradigm for querying XML documents is full-text search on textual content. In this paper, we study fundamental challenges that arise when we try to integrate these two querying paradigms. While keyword search is based on approximate matching, XPath has exact match semantics. We address this mismatch by considering queries on structure as a "template", and looking for answers that best match this template and the full-text search. To achieve this, we provide an elegant definition of relaxation on structure and define primitive operators to span the space of relaxations. Query answering is now based on ranking potential answers on structural and full-text search conditions. We set out certain desirable principles for ranking schemes and propose natural ranking schemes that adhere to these principles. We develop efficient algorithms for answering top-K queries and discuss results from a comprehensive set of experiments that demonstrate the utility and scalability of the proposed framework and algorithms.
Topic Query Languages
Identifier Proceedings of the ACM SIGMOD International Conference on Management of Data, Paris, France, 13-18 June 2004, 83-94