用户名: 密码: 验证码:
Graph DBs vs. Column-Oriented Stores: A Pure Performance Comparison
详细信息    查看全文
  • 关键词:NoSQL databases ; Hadoop ; HBase ; Neo4j ; Graph databases ; Distributed systems
  • 刊名:Lecture Notes in Computer Science
  • 出版年:2016
  • 出版时间:2016
  • 年:2016
  • 卷:9511
  • 期:1
  • 页码:62-74
  • 全文大小:600 KB
  • 参考文献:1.Angles, R., Gutierrez, C.: Survey of graph database models. ACM Comput. Surv. 40(1), 1:1–1:39 (2008)CrossRef
    2.Brewer, E.: Cap twelve years later: how the “rules” have changed. Computer 45(2), 23–29 (2012)CrossRef
    3.Cai, L., Huang, S., Chen, L., Zheng, Y.: Performance analysis and testing of hbase based on its architecture. In: 2013 IEEE/ACIS 12th International Conference on Computer and Information Science (ICIS), pp. 353–358, June 2013
    4.Chang, F., Dean, J., Ghemawat, S., Hsieh, W.C., Wallach, D.A., Burrows, M., Chandra, T., Fikes, A., Gruber, R.E.: Bigtable: a distributed storage system for structured data. In: Proceedings of the 7th Symposium on Operating Systems Design and Implementation, pp. 205–218. OSDI 2006, USENIX Association, Berkeley, CA, USA (2006)
    5.DeCandia, G., Hastorun, D., Jampani, M., Kakulapati, G., Lakshman, A., Pilchin, A., Sivasubramanian, S., Vosshall, P., Vogels, W.: Dynamo: amazon’s highly available key-value store. SIGOPS Oper. Syst. Rev. 41(6), 205–220 (2007)CrossRef
    6.George, L.: HBase: The Definitive Guide. O’Reilly Media Inc., Sebastopol (2011)
    7.Holzschuher, F., Peinl, R.: Performance of graph query languages: comparison of cypher, gremlin and native access in neo4j. In: Proceedings of the Joint EDBT/ICDT 2013 Workshops, EDBT 2013, NY, USA, pp. 195–204. ACM, New York (2013)
    8.Kostylev, E.V., Reutter, J.L., Vrgoc, D.: Containment of data graph queries. In: ICDT, pp. 131–142 (2014)
    9.Kristina, C., Michael, D.: MongoDB: The Definitive Guide. O’Reilly Media, Sebastopol (2010)
    10.Lakshman, A., Malik, P.: Cassandra: a decentralized structured storage system. ACM SIGOPS Oper. Syst. Rev. 44(2), 35–40 (2010)CrossRef
    11.Neo4j.org: Neo4j - the world’s leading graph database. http://​www.​neo4j.​org/​ , Accessed on 16 june 2014
    12.Nishimura, S., Das, S., Agrawal, D., Abbadi, A.: Md-hbase: a scalable multi-dimensional data infrastructure for location aware services. In: 2011 12th IEEE International Conference on Mobile Data Management (MDM), vol. 1, pp. 7–16, June 2011
    13.Robinson, I., Webber, J., Eifrem, E.: Graph Databases. O’Reilly Media, Inc., Sebastopol (2013)
    14.Shvachko, K., Kuang, H., Radia, S., Chansler, R.: The hadoop distributed file system. In: 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST), pp. 1–10, May 2010
    15.Thusoo, A., Sarma, J.S., Jain, N., Shao, Z., Chakka, P., Anthony, S., Liu, H., Wyckoff, P., Murthy, R.: Hive: a warehousing solution over a map-reduce framework. Proc. VLDB Endow. 2(2), 1626–1629 (2009)CrossRef
    16.Vicknair, C., Macias, M., Zhao, Z., Nan, X., Chen, Y., Wilkins, D.: A comparison of a graph database and a relational database: a data provenance perspective. In: Proceedings of the 48th Annual Southeast Regional Conference, ACM SE 2010, NY, USA, pp. 42: 1–42: 6. ACM, New York (2010)
    17.White, T.: Hadoop: The Definitive Guide, 3rd edn. O’Reilly Media Inc., Sebastopol (2012)
    18.Wood, P.T.: Query languages for graph databases. SIGMOD Rec. 41(1), 50–60 (2012)CrossRef
  • 作者单位:Marios Kendea (17)
    Vassiliki Gkantouna (17)
    Angeliki Rapti (17)
    Spyros Sioutas (18)
    Giannis Tzimas (19)
    Dimitrios Tsolis (20)

    17. Computer Engineering and Informatics Department, University of Patras, 26504, Patras, Greece
    18. Department of Informatics, Ionian University, 49100, Corfu, Greece
    19. Computer and Informatics Engineering Department, Technological Educational Institute of Western Greece, 26334, Patras, Greece
    20. Department of Cultural Heritage, Management and New Technologies, University of Patras, 26504, Patras, Greece
  • 丛书名:Algorithmic Aspects of Cloud Computing
  • ISBN:978-3-319-29919-8
  • 刊物类别:Computer Science
  • 刊物主题:Artificial Intelligence and Robotics
    Computer Communication Networks
    Software Engineering
    Data Encryption
    Database Management
    Computation by Abstract Devices
    Algorithm Analysis and Problem Complexity
  • 出版者:Springer Berlin / Heidelberg
  • ISSN:1611-3349
文摘
Cloud Computing has brought a great change in the way information is stored and applications run. In order for one or more clusters to work as a cloud we need a middleware framework, such as Apache Hadoop [17], that provides reliability, scalability and distributed computing. Once the infrastructure has been established, a software framework can be installed, which runs on top of it and will be the connection to communicate with the applications developed by the users. The software, in this regard, is a NoSQL database. This paper deals with the problem of searching data in some widespread NoSQL databases used in cloud computing. Two categories of NoSQL databases are compared; one based on columns using a column-oriented key-value store, HBase [6], and a high-available graph database, Neo4j [11]. HBase is a distributed, scalable storage system that runs on top of HDFS, and has being designed based on Google’s BigTable [4]. Neo4j has being designed and developed to be a reliable database, optimized for graph structures, instead of tables, and is a robust, scalable, high performance and high available database that supports ACID transactions and queries written in Cypher language. The aim of this paper is to create a novel system that will decide when a query must be send to be executed in a key-value store or a graph database. Thus, an experimental pure performance comparison has been made between Apache HBase and Neo4j for a variety of queries, that were programmed using systems API’s and Java language.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700