APEX

APEX: an adaptive path index for XML data. The emergence of the Web has increased interests in XML data. XML query languages such as XQuery and XPath use label paths to traverse the irregularly structured data. Without a structural summary and efficient indexes, query processing can be quite inefficient due to an exhaustive traversal on XML data. To overcome the inefficiency, several path indexes have been proposed in the research community. Traditional indexes generally record all label paths from the root element in XML data. Such path indexes may result in performance degradation due to large sizes and exhaustive navigations for partial matching path queries start with the self-or-descendent axis(”//”).In this paper, we propose APEX, an adaptive path index for XML data. APEX does not keep all paths starting from the root and utilizes frequently used paths to improve the query performance. APEX also has a nice property that it can be updated incrementally according to the changes of query workloads. Experimental results with synthetic and real-life data sets clearly confirm that APEX improves query processing cost typically 2 to 54 times better than the existing indexes, with the performance gap increasing with the irregularity of XML data.


References in zbMATH (referenced in 12 articles )

Showing results 1 to 12 of 12.
Sorted by year (citations)

  1. Lee, Chun-Hee; Chung, Chin-Wan: Efficient search in graph databases using cross filtering (2014)
  2. Hsu, Wen-Chiao; Liao, I-En: CIS-X: a compacted indexing scheme for efficient query evaluation of XML documents (2013)
  3. Li, Guoliang; Feng, Jianhua; Wang, Jianyong; Zhou, Lizhu: Incremental sequence-based frequent query pattern mining from XML queries (2009) ioport
  4. Arion, Andrei; Bonifati, Angela; Manolescu, Ioana; Pugliese, Andrea: Path summaries and path partitioning in modern XML databases (2008) ioport
  5. May, Wolfgang; Behrends, Erik; Fritzen, Oliver: Integrating and querying distributed XML data via XLink (2008) ioport
  6. Ng, Patrick K. L.; Ng, Vincent T. Y.: Rrsi: Indexing XML data for proximity twig queries (2008) ioport
  7. Chung, Yon Dohn; Lee, Ji Yeon: An indexing method for wireless broadcast XML data (2007) ioport
  8. Gorelov, S. S.: Optimal schema hierarchies in searching semistructured databases by conjunctive regular path queries (2006)
  9. Wong, Kam-Fai; Yu, Jeffrey Xu; Tang, Nan: Answering XML queries using path-based indexes: a survey (2006) ioport
  10. Schenkel, Ralf; Theobald, Anja; Weikum, Gerhard: Semantic similarity search on semistructured data with the XXL search engine (2005) ioport
  11. Schenkel, Ralf; Theobald, Anja; Weikum, Gerhard: Semantic similarity search on semistructured data with the XXL search engine (2005) ioport
  12. Chen, Zhiyuan; Li, Chen; Pei, Jian; Tao, Yufei; Wang, Haixun; Wang, Wei; Yang, Jiong; Yang, Jun; Zhang, Donghui: Recent progress on selected topics in database research. -- A report by nine young Chinese researchers working in the United States. (2003)