用户名: 密码: 验证码:
A dynamic and parallel approach for repetitive prime labeling of XML with MapReduce
详细信息    查看全文
  • 作者:Jinhyun Ahn ; Dong-Hyuk Im ; Taewhi Lee ; Hong-Gee Kim
  • 关键词:Repetitive prime labeling ; MapReduce ; XML ; Tree
  • 刊名:The Journal of Supercomputing
  • 出版年:2017
  • 出版时间:February 2017
  • 年:2017
  • 卷:73
  • 期:2
  • 页码:810-836
  • 全文大小:
  • 刊物类别:Computer Science
  • 刊物主题:Programming Languages, Compilers, Interpreters; Processor Architectures; Computer Science, general;
  • 出版者:Springer US
  • ISSN:1573-0484
  • 卷排序:73
文摘
A massive amount of extensible markup language (XML) data from various areas is available on the Web. Answering structural queries against XML data is important, as it is the core of information retrieval systems for XML data. Labeling scheme has been suggested for rapid query processing of massive XML data. Interval-based, prefix-based, and prime number labeling scheme exist. Of these, the prime number labeling scheme has the advantage of query processing by arithmetic operations. Recently, the repetitive prime number labeling scheme was proposed; this scheme produces a smaller label size than conventional prime number labeling using prime numbers repetitively. However, a parallel algorithm for the repetitive prime number labeling scheme does not exist; therefore, this scheme is difficult to apply to massive XML data. In this paper, a dynamic and parallel approach of XML labeling algorithm that works with MapReduce is proposed for, particularly, the repetitive prime number labeling scheme. Two optimization techniques are devised: the label assignment order adjustment to further reduce the label size and the upper tree compressing technique to reduce the memory requirements during the labeling process. Experiments over real-world XML data confirmed that the techniques are effective than the previous works.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700