CFP last date
20 February 2025
Reseach Article

Empirical Insights into Replication Models for Distributed Database Environments

by Theophilus Acquah, Richard Amankwah, Bright Appiah
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 186 - Number 53
Year of Publication: 2024
Authors: Theophilus Acquah, Richard Amankwah, Bright Appiah
10.5120/ijca2024924212

Theophilus Acquah, Richard Amankwah, Bright Appiah . Empirical Insights into Replication Models for Distributed Database Environments. International Journal of Computer Applications. 186, 53 ( Dec 2024), 27-34. DOI=10.5120/ijca2024924212

@article{ 10.5120/ijca2024924212,
author = { Theophilus Acquah, Richard Amankwah, Bright Appiah },
title = { Empirical Insights into Replication Models for Distributed Database Environments },
journal = { International Journal of Computer Applications },
issue_date = { Dec 2024 },
volume = { 186 },
number = { 53 },
month = { Dec },
year = { 2024 },
issn = { 0975-8887 },
pages = { 27-34 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume186/number53/empirical-insights-into-replication-models-for-distributed-database-environments/ },
doi = { 10.5120/ijca2024924212 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-12-07T02:20:23.458771+05:30
%A Theophilus Acquah
%A Richard Amankwah
%A Bright Appiah
%T Empirical Insights into Replication Models for Distributed Database Environments
%J International Journal of Computer Applications
%@ 0975-8887
%V 186
%N 53
%P 27-34
%D 2024
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In the age of big data and cloud computing, replication in distributed databases has become critical to assuring data availability, fault tolerance, and performance optimization. Businesses in a variety of industries rely on effective replication systems to ensure uninterrupted access to essential data, even in the event of node failure or network partitions. Despite major advances in network technology and distributed systems, replication remains a difficult and multidimensional problem. This paper seeks to provide a review of the present status of replication in distributed databases, focusing on various replication strategies, their inherent problems, and best practices for implementation. The goals of this research are threefold: first, to define the various replication models and their trade-offs in terms of consistency, availability, and partition tolerance; second, to examine the performance and scalability of these models using empirical studies and real-world case studies; and third, to make recommendations for optimizing replication strategies in distributed systems. The Methodology consists of a detailed literature assessment of current research published within the last five years, as well as an empirical investigation of existing replication models. This study finishes by identifying opportunities for further research, such as developing more efficient replication protocols, integrating sharding with replication, and investigating decentralized replication models. Through this extensive investigation, we hope to contribute to the ongoing efforts to optimize replication in distributed databases, ensuring that they match the changing demands of modern applications.

References
  1. A. Noraziah, A. A. C. Fauzi, S. H. S. A. Ubaidillah, B. Alkazemi, and J. B. Odili, “BVAGQ-AR for Fragmented Database Replication Management,” IEEE Access, vol. 9, pp. 56168–56177, 2021, doi: 10.1109/ACCESS.2021.3065944.
  2. T. Taipalus, “Database management system performance comparisons: A systematic literature review,” Journal of Systems and Software, vol. 208, Feb. 2024, doi: 10.1016/j.jss.2023.111872.
  3. P. Peng, E. Soljanin, and P. Whiting, “Diversity vs. Parallelism in Distributed Computing with Redundancy,” in IEEE International Symposium on Information Theory - Proceedings, Institute of Electrical and Electronics Engineers Inc., Jun. 2020, pp. 257–262. doi: 10.1109/ISIT44484.2020.9174030.
  4. W. Yu and C. L. Ignat, “Conflict-Free Replicated Relations for Multi-Synchronous Database Management at Edge,” in Proceedings - 2020 IEEE International Conference on Smart Data Services, SMDS 2020, Institute of Electrical and Electronics Engineers Inc., Oct. 2020, pp. 113–121. doi: 10.1109/SMDS49396.2020.00021.
  5. S. Lima et al., “Optimal Causal Access in Geo-Replicated Storage Systems,” 2022, doi: 10.21203/rs.3.rs-2051314/v1.
  6. J. Liu et al., “A Low-Cost Multi-Failure Resilient Replication Scheme for High-Data Availability in Cloud Storage,” IEEE/ACM Transactions on Networking, vol. 29, no. 4, pp. 1436–1451, Aug. 2021, doi: 10.1109/TNET.2020.3027814.
  7. S. Naganandhini and D. Shanthi, “Optimizing Replication of Data for Distributed Cloud Computing Environments: Techniques, Challenges, and Research Gap,” in Proceedings of the 2nd International Conference on Edge Computing and Applications, ICECAA 2023, Institute of Electrical and Electronics Engineers Inc., 2023, pp. 35–41. doi: 10.1109/ICECAA58104.2023.10212287.
  8. P. Mondal and E. Tilevich, “Undoing CRDT Operations Automatically,” in Proceedings of the International Conference on Cloud Computing Technology and Science, CloudCom, IEEE Computer Society, 2023, pp. 246–251. doi: 10.1109/CloudCom59040.2023.00047.
  9. D. Wladdimiro, L. Arantes, P. Sens, and N. Hidalgo, “A predictive approach for dynamic replication of operators in distributed stream processing systems,” in Proceedings - Symposium on Computer Architecture and High Performance Computing, IEEE Computer Society, 2022, pp. 120–129. doi: 10.1109/SBAC-PAD55451.2022.00023.
  10. S. Maiyya, I. Ahmad, D. Agrawal, and A. El Abbadi, “Samya: A geo-distributed data system for high contention aggregate data,” in Proceedings - International Conference on Data Engineering, IEEE Computer Society, Apr. 2021, pp. 1440–1451. doi: 10.1109/ICDE51399.2021.00128.
  11. W. Cao et al., “PolarDB-X: An Elastic Distributed Relational Database for Cloud-Native Applications,” in Proceedings - International Conference on Data Engineering, IEEE Computer Society, 2022, pp. 2859–2872. doi: 10.1109/ICDE53745.2022.00259.
  12. F. Hackett, J. Rowe, and M. A. Kuppe, “Understanding Inconsistency in Azure Cosmos DB with TLA+,” in Proceedings - International Conference on Software Engineering, IEEE Computer Society, 2023, pp. 1–12. doi: 10.1109/ICSE-SEIP58684.2023.00006.
  13. S. Liu, H. Xu, L. Liu, W. Bai, K. Chen, and Z. Cai, “RepNet: Cutting Latency with Flow Replication in Data Center Networks,” IEEE Trans Serv Comput, vol. 14, no. 1, pp. 248–261, Jan. 2021, doi: 10.1109/TSC.2018.2793250.
  14. R. Adila, A. B. Nusantara, and U. L. Yuhana, “Optimization Techniques for Data Consistency and Throughput Using Kafka Stateful Stream Processing,” in 6th International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2023 - Proceeding, Institute of Electrical and Electronics Engineers Inc., 2023, pp. 480–485. doi: 10.1109/ISRITI60336.2023.10467675.
  15. S. Han, S. Jang, H. Lee, and S. Pack, “Switch-Centric Byzantine Fault Tolerance Mechanism in Distributed Software Defined Networks,” IEEE Communications Letters, vol. 24, no. 10, pp. 2236–2239, Oct. 2020, doi: 10.1109/LCOMM.2020.3004038.
  16. Z. Mahfoud and N. Nouali-Taboudjemat, “Enhancing Data Consistency via a Context-Aware Dynamic Adaptive Model,” International Journal of Computing and Digital Systems, vol. 15, no. 1, pp. 543–554, Aug. 2024, doi: 10.12785/ijcds/160141.
  17. P. Mishra, S. Kar, V. Bollapragada, and K. C. Wang, “Multi-Connectivity using NR-DC for High Throughput and Ultra-reliable Low Latency Communication in 5G Networks,” in Proceedings - 2021 IEEE 4th 5G World Forum, 5GWF 2021, Institute of Electrical and Electronics Engineers Inc., 2021, pp. 36–40. doi: 10.1109/5GWF52925.2021.00014.
  18. Z. Hu and D. Li, “Improved Heuristic Job Scheduling Method to Enhance Throughput for Big Data Analytics.” [Online]. Available: http://creativecommons.org/licenses/by/4.0/
  19. A. Behrouzi-Far and E. Soljanin, “Efficient Replication for Fast and Predictable Performance in Distributed Computing,” IEEE/ACM Transactions on Networking, vol. 29, no. 4, pp. 1467–1476, Aug. 2021, doi: 10.1109/TNET.2021.3062215.
  20. G. Yao, Q. Ren, X. Li, S. Zhao, and R. Ruiz, “A Hybrid Fault-Tolerant Scheduling for Deadline-Constrained Tasks in Cloud Systems,” IEEE Trans Serv Comput, vol. 15, no. 3, pp. 1371–1384, 2022, doi: 10.1109/TSC.2020.2992928.
  21. J. M. D. S. Sant’Ana, A. Hoeller, R. D. Souza, S. Montejo-Sanchez, H. Alves, and M. De Noronha-Neto, “Hybrid Coded Replication in LoRa Networks,” IEEE Trans Industr Inform, vol. 16, no. 8, pp. 5577–5585, Aug. 2020, doi: 10.1109/TII.2020.2966120.
  22. W. Yu, T. J. Chua, and J. Zhao, “Asynchronous Hybrid Reinforcement Learning for Latency and Reliability Optimization in the Metaverse Over Wireless Communications,” IEEE Journal on Selected Areas in Communications, vol. 41, no. 7, pp. 2138–2157, Jul. 2023, doi: 10.1109/JSAC.2023.3280988.
  23. J. M. D. S. Sant’ana, S. Montejo-Sanchez, R. D. Souza, and H. Alves, “Nonorthogonal Replication Scheme for ALOHA Uplink in LPWAN,” IEEE Trans Industr Inform, vol. 20, no. 2, pp. 1575–1584, Feb. 2024, doi: 10.1109/TII.2023.3280315.
Index Terms

Computer Science
Information Sciences

Keywords

Replication Distributed Databases Fault Tolerance Performance Optimization