earticle

논문검색

Enhancing the Estimation Quality of Element-centered XML Summarization Methods

초록

영어

An XML summary should enable cardinality estimations of different kinds on an XML document to flexibly support query optimization for languages such as XPath or XQuery. In contrast to conventional methods which typically emulate the document structure and record path-oriented statistics for it, element-centered XML summarization methods collect statistical information for document nodes and their axes relationships and aggregate them separately for each distinct element/attribute name. It has already partially proven its superiority in quality, space consumption, and evaluation performance. Surprisingly, this kind of inversion seems to have more service capability than conventional approaches. It is not only confined to the cardinality estimation of child and descendant axes, but also allows to approximate parent and ancestor axes, too. Therefore, we refined and extended elementcentered XML summarization methods to capture more statistical information and propose new estimation procedures. We tested our ideas on a set of documents with largely varying characteristics.1

목차

Abstract
 1. Introduction
 2. Basic Concepts and Definitions
 3. A Brief look at EXsum
 4. Extending EXsum
  4.1. Building Algorithm
  4.2. Heuristics to Support Estimation of Longer Path Expressions
 5. Empirical Evaluation
  5.1. Timing Analysis
  5.2. Sizing Analysis
  5.3. Estimation Quality
 6. Conclusion
 References

저자정보

  • José de Aguiar Moraes Filho University of Kaiserslautern
  • Theo Härder University of Kaiserslautern
  • Caetano Sauer University of Kaiserslautern

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      ※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

      0개의 논문이 장바구니에 담겼습니다.