Cloud; Data consistency; Databases; NoSQL; Performance; Cassandras; Cloud-based; Consistency level; Database management; Management systems; MongoDB; Systems performance; Software; Computer Networks and Communications
Résumé :
[en] When using database management systems (DBMSs), it is common to distribute instance replicas across multiple locations for disaster recovery and scaling purposes. To efficiently geo-replicate data, it is crucial to ensure the data and its replicas remain consistent with the same and the most up-to-date data. However, DBMSs’ inner characteristics and external factors, such as the replication strategy and network latency, can affect system performance when dealing with data replication, especially when the replicas are deployed far apart from the others. Thus, it is essential to comprehend how achieving high data consistency levels in geo-replicated systems can impact systems performance. This work analyzes various data consistency settings for the widely used NoSQL DBMSs, namely MongoDB, Redis, and Cassandra. The analysis is based on real-world experiments in which DBMS nodes are deployed on cloud platforms in different locations, considering single and multiple region deployments. Based on the results of the experiments, we provide a comprehensive analysis regarding the system throughput and response time when executing reading and writing operations, pointing out scenarios where each DBMS could be better employed. Some of our findings include, for instance, that opting for strong data consistency significantly impacts Cassandra’s reading operations in the single-region deployment, while MongoDB writing operations are most affected in a multi-region scenario. Additionally, all of these DBMSs exhibit statistically significant variations across all scenarios in the multi-region setup when the data consistency is switched from weak to stronger level.
Disciplines :
Sciences informatiques
Auteur, co-auteur :
Ferreira, Saulo; Universidade Federal Rural de Pernambuco, Recife, Brazil
RODRIGUES DE MENDONÇA NETO, Júlio ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > CritiX
Nogueira, Bruno; Universidade Federal de Alagoas, Maceió, Brazil
Tiengo, Willy; Universidade Federal de Alagoas, Maceió, Brazil
Andrade, Ermeson; Universidade Federal Rural de Pernambuco, Recife, Brazil
Co-auteurs externes :
yes
Langue du document :
Anglais
Titre :
Impacts of data consistency levels in cloud-based NoSQL for data-intensive applications
Date de publication/diffusion :
décembre 2024
Titre du périodique :
Journal of Cloud Computing
eISSN :
2192-113X
Maison d'édition :
Springer Science and Business Media Deutschland GmbH
R.D. Ab Rashid Dar Survey on scalability in cloud environment Int J Adv Res Comput Eng Technol 2016 5 7 2124 2128
K. Nadiminti M.D. De Assunçao R. Buyya Distributed systems and recent innovations: Challenges and benefits InfoNet Mag 2006 16 3 1 5
Abualkishik AZ, Alwan AA, Gulzar Y (2020) Disaster recovery in cloud computing systems: An overview. Int J Adv Comput Sci Appl 11(9):702–710
Ledmi A, Bendjenna H, Hemam SM (2018) Fault tolerance in distributed systems: A survey. In: 2018 3rd International Conference on Pattern Analysis and Intelligent Systems (PAIS), IEEE, pp 1–5
Abd Alnabe N, Zeebaree SR (2024) Distributed systems for real-time computing in cloud environment: A review of low-latency and time sensitive applications. Indones J Comput Sci 13(2):2549–7286
Y. Mansouri V. Prokhorenko M.A. Babar An automated implementation of hybrid cloud for performance evaluation of distributed databases J Netw Comput Appl 2020 167 102 740
W. Al Shehri Cloud database database as a service Int J Database Manag Syst 2013 5 2 1 10.5121/ijdms.2013.5201
Shapiro M, Sutra P (2018) Database consistency models. arXiv preprint arXiv:1804.00914
Gorbenko A, Romanovsky A, Tarasyuk O (2020) Interplaying cassandra nosql consistency and performance: A benchmarking approach. In: Dependable Computing-EDCC 2020 Workshops: AI4RAILS, DREAMS, DSOGRI, SERENE 2020, Munich, Germany, September 7, 2020, Proceedings 16, Springer, pp 168–184
Gomes C, de O Junior MN, Nogueira B, Maciel P, Tavares E (2023) Nosql-based storage systems: influence of consistency on performance, availability and energy consumption. J Supercomput 79(18):21424–21448
H. Wada A.D. Fekete L. Zhao K. Lee A. Liu Data consistency properties and the trade-offs in commercial cloud storage: the consumers’ perspective CIDR 2011 11 134 143
M. Diogo B. Cabral J. Bernardino Consistency models of nosql databases Futur Internet 2019 11 2 43 10.3390/fi11020043
C. Strauch U.L.S. Sites W. Kriha Nosql databases Lect Notes Stuttgart Media Univ 2011 20 24 79
Moniruzzaman A, Hossain SA (2013) Nosql database: New era of databases for big data analytics-classification, characteristics and comparison. arXiv preprint arXiv:1307.0191
DB-Engines (2024) DB-Engines Ranking. https://db-engines.com/en/ranking. Accessed 20 Jan 2024
E.A. Brewer Towards robust distributed systems PODC, Portland, OR 2000 7 343477 343502
S. Gilbert N. Lynch Perspectives on the cap theorem Computer 2012 45 2 30 36 10.1109/MC.2011.389
E. Hewitt Cassandra: the definitive guide 2010 Newton O’Reilly Media Inc
P. Membrey E. Plugge T. Hawkins D. Hawkins The definitive guide to MongoDB: the noSQL database for cloud and desktop computing 2010 New York Springer
Sanfilippo S, Noordhuis P (2009) Redis. https://redis.io. Accessed 10 June 2024
Chen S, Tang X, Wang H, Zhao H, Guo M (2016) Towards scalable and reliable in-memory storage system: A case study with redis. In: 2016 IEEE Trustcom/BigDataSE/ISPA, IEEE, pp 1660–1667
Han J, Haihong E, Le G, Du J (2011) Survey on nosql database. In: 2011 6th international conference on pervasive computing and applications, IEEE, pp 363–366
Mohamed MA, Altrafi OG, Ismail MO (2014) Relational vs. nosql databases: A survey. Int J Comput Inf Technol 3(03):598–601
W. Khan T. Kumar C. Zhang K. Raj A.M. Roy B. Luo Sql and nosql database software architecture performance analysis and assessments—a systematic literature review Big Data Cogn Comput 2023 7 2 97 10.3390/bdcc7020097
M. Abu Kausar M. Nasar A. Soosaimanickam A study of performance and comparison of nosql databases: Mongodb, cassandra, and redis using ycsb Indian J Sci Technol 2022 15 31 1532 1540 10.17485/IJST/v15i31.1352
Gandini A, Gribaudo M, Knottenbelt WJ, Osman R, Piazzolla P (2014) Performance evaluation of nosql databases. In: Computer Performance Engineering: 11th European Workshop, EPEW 2014, Florence, Italy, September 11-12, 2014. Proceedings 11, Springer, pp 16–29
V. Abramova J. Bernardino P. Furtado Which nosql database? a performance overview Open J Databases (OJDB) 2014 1 2 17 24
Wang H, Li J, Zhang H, Zhou Y (2014) Benchmarking replication and consistency strategies in cloud serving databases: Hbase and cassandra. In: Workshop on Big Data Benchmarks, Performance Optimization, and Emerging Hardware, Springer, pp 71–82
Gomes C, Borba E, Tavares E, Junior MNdO (2019) Performability model for assessing nosql dbms consistency. In: 2019 IEEE International Systems Conference (SysCon), IEEE, pp 1–6
Haughian G, Osman R, Knottenbelt WJ (2016) Benchmarking replication in cassandra and mongodb nosql datastores. In: International Conference on Database and Expert Systems Applications, Springer, pp 152–166
Ferreira S, Andrade E, Mendonça J (2021) Uma abordagem experimental para avaliar os níveis de consistência do banco de dados nosql cassandra. In: Anais do XXII Simpósio em Sistemas Computacionais de Alto Desempenho, SBC, pp 156–167
Heyman J, Byström C, Hamrén J, Heyman H (2012) Locust.io. https://locust.io/. Accessed 10 June 2024
Pradeep S, Sharma YK (2019) A pragmatic evaluation of stress and performance testing technologies for web based applications. In: 2019 Amity International Conference on Artificial Intelligence (AICAI), IEEE, pp 399–403
D.C. Montgomery Design and analysis of experiments 2017 Hoboken John wiley & sons
R. DIAgostino An omnibus test of normality for moderate and large sample sizes Biometrika 1971 58 34 1 348 281300
Kotz S, Johnson NL (eds) (1992) The Probable Error of a Mean, Springer New York, New York, pp 33–57. https://doi.org/10.1007/978-1-4612-4380-9_4
McKnight PE, Najab J (2010) Mann-whitney u test. The Corsini encyclopedia of psychology, Wiley, Hoboken, New Jersey, p 1
Redis (2024) Redis Documentation. https://redis.io/docs/latest/. Accessed 17 June 2024
Cooper BF, Silberstein A, Tam E, Ramakrishnan R, Sears R (2010) Benchmarking cloud serving systems with YCSB. In: Proceedings of the 1st ACM symposium on Cloud computing, Association for Computing Machinery, New York, New York, pp 143–154