Benchmark Cassandra and MongoDB with YCSB Using Different Amount of Shards, Replication Factors and Throughputs

碩士 === 國立彰化師範大學 === 資訊管理學系所 === 105 === With the development of the information and communication technology, people can collect data easier than before. This evolution also speeds up the growth of the amount of data that can be used for analysis. This movement leads us into the big data era. Howeve...

Full description

Bibliographic Details
Main Authors: Chou, Li-Heng, 周立珩
Other Authors: Wu, Tung-Kuang
Format: Others
Language:zh-TW
Published: 2017
Online Access:http://ndltd.ncl.edu.tw/handle/ux6xy8
id ndltd-TW-105NCUE5396040
record_format oai_dc
spelling ndltd-TW-105NCUE53960402019-05-16T00:00:23Z http://ndltd.ncl.edu.tw/handle/ux6xy8 Benchmark Cassandra and MongoDB with YCSB Using Different Amount of Shards, Replication Factors and Throughputs 使用YCSB分析在不同分片數、副本數與吞吐量下MongoDB與Cassandra的效能差異 Chou, Li-Heng 周立珩 碩士 國立彰化師範大學 資訊管理學系所 105 With the development of the information and communication technology, people can collect data easier than before. This evolution also speeds up the growth of the amount of data that can be used for analysis. This movement leads us into the big data era. However, due to the limitations of relational database, it can no longer meet the current big data requirements. Therefore, a new type of database model, NoSQL, has become the focus of various IT-related industries in this big data trend. When it comes to the choice of a specific NoSQL database, corporations must evaluate its characheristic with respect to their needs carefully. Accordingly, in this study we aim to conduct an all aspects evaluation of the top two NoSQL databases, MongoDB and Cassandra, so that related corporations may have better insight into these two systems. YCSB (Yahoo! Cloud Serving Benchmark) is adopted as the benchmarking program suit, to evaluate the two systems with different amont of shards, replication factors and throughputs. Our results show that Cassandra significantly outperformed MongoDB in most of our experiments. Wu, Tung-Kuang 吳東光 2017 學位論文 ; thesis 147 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立彰化師範大學 === 資訊管理學系所 === 105 === With the development of the information and communication technology, people can collect data easier than before. This evolution also speeds up the growth of the amount of data that can be used for analysis. This movement leads us into the big data era. However, due to the limitations of relational database, it can no longer meet the current big data requirements. Therefore, a new type of database model, NoSQL, has become the focus of various IT-related industries in this big data trend. When it comes to the choice of a specific NoSQL database, corporations must evaluate its characheristic with respect to their needs carefully. Accordingly, in this study we aim to conduct an all aspects evaluation of the top two NoSQL databases, MongoDB and Cassandra, so that related corporations may have better insight into these two systems. YCSB (Yahoo! Cloud Serving Benchmark) is adopted as the benchmarking program suit, to evaluate the two systems with different amont of shards, replication factors and throughputs. Our results show that Cassandra significantly outperformed MongoDB in most of our experiments.
author2 Wu, Tung-Kuang
author_facet Wu, Tung-Kuang
Chou, Li-Heng
周立珩
author Chou, Li-Heng
周立珩
spellingShingle Chou, Li-Heng
周立珩
Benchmark Cassandra and MongoDB with YCSB Using Different Amount of Shards, Replication Factors and Throughputs
author_sort Chou, Li-Heng
title Benchmark Cassandra and MongoDB with YCSB Using Different Amount of Shards, Replication Factors and Throughputs
title_short Benchmark Cassandra and MongoDB with YCSB Using Different Amount of Shards, Replication Factors and Throughputs
title_full Benchmark Cassandra and MongoDB with YCSB Using Different Amount of Shards, Replication Factors and Throughputs
title_fullStr Benchmark Cassandra and MongoDB with YCSB Using Different Amount of Shards, Replication Factors and Throughputs
title_full_unstemmed Benchmark Cassandra and MongoDB with YCSB Using Different Amount of Shards, Replication Factors and Throughputs
title_sort benchmark cassandra and mongodb with ycsb using different amount of shards, replication factors and throughputs
publishDate 2017
url http://ndltd.ncl.edu.tw/handle/ux6xy8
work_keys_str_mv AT chouliheng benchmarkcassandraandmongodbwithycsbusingdifferentamountofshardsreplicationfactorsandthroughputs
AT zhōulìháng benchmarkcassandraandmongodbwithycsbusingdifferentamountofshardsreplicationfactorsandthroughputs
AT chouliheng shǐyòngycsbfēnxīzàibùtóngfēnpiànshùfùběnshùyǔtūntǔliàngxiàmongodbyǔcassandradexiàonéngchàyì
AT zhōulìháng shǐyòngycsbfēnxīzàibùtóngfēnpiànshùfùběnshùyǔtūntǔliàngxiàmongodbyǔcassandradexiàonéngchàyì
_version_ 1719157584223535104