CFP last date
20 January 2025
Reseach Article

A Primary Shift Protocol for Improving Availability in Replication Systems

by Almetwally M. Mostafa, Ahmed E. Youssef
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 72 - Number 4
Year of Publication: 2013
Authors: Almetwally M. Mostafa, Ahmed E. Youssef
10.5120/12485-8905

Almetwally M. Mostafa, Ahmed E. Youssef . A Primary Shift Protocol for Improving Availability in Replication Systems. International Journal of Computer Applications. 72, 4 ( June 2013), 37-44. DOI=10.5120/12485-8905

@article{ 10.5120/12485-8905,
author = { Almetwally M. Mostafa, Ahmed E. Youssef },
title = { A Primary Shift Protocol for Improving Availability in Replication Systems },
journal = { International Journal of Computer Applications },
issue_date = { June 2013 },
volume = { 72 },
number = { 4 },
month = { June },
year = { 2013 },
issn = { 0975-8887 },
pages = { 37-44 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume72/number4/12485-8905/ },
doi = { 10.5120/12485-8905 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:38:46.875447+05:30
%A Almetwally M. Mostafa
%A Ahmed E. Youssef
%T A Primary Shift Protocol for Improving Availability in Replication Systems
%J International Journal of Computer Applications
%@ 0975-8887
%V 72
%N 4
%P 37-44
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Primary Backup Replication (PBR) is the most common technique to achieve availability in distributed systems. However, primary failure remains a crucial problem that threatens availability. When the primary fails, backup nodes in the system have to elect a new primary node in order to maintain adequate system's operation. During election, the system suffers from transaction loss, communication overhead due to messages exchange necessary to preserve data consistency, and a notable delay caused by the execution of Leader Election Algorithms (LEA). Primary failures can be unpredictable (i. e. , unplanned), such as primary node crashes and network outages, or predictable (i. e. , planned), such as primary's scheduled shutdown to perform routine maintenance or software upgrade. Traditionally, PBR employ LEA to recover from both unplanned and planned outages. In this paper, we propose a novel protocol, called Primary Shift Replication (PSR), to avoid election during planned outages. PSR shifts the primary role from the current primary to another scheduled node (without election) when a planned outage is about to occur. Number of messages and communication time required to shift the primary node to another node is much less than number of messages and time required to perform leader election; therefore, PSR improves system's availability. Moreover, PSR guarantees no transactions loss during the shift mode, hence, it preserves data consistency.

References
  1. N. Budhiraja, K. Marzullo, F. B. Schneider, and S. Toueg. 1993. The Primary-backup Approach. S. J. Mullender, editor, Distributed Systems, Addison-Wesley, (Chapter 8).
  2. Navin Budhiraja, Keith Marzullo. 1995. Tradeoffs in Implementing Primary-Backup Protocols. SPDP '95 Proceedings of the 7th IEEE Symposium on Parallel and Distributed Processing.
  3. Greg N. Frederickson and Nancy A. Lynch. 1987. Electing a Leader in a Synchronous Ring. J. ACM, 34(1):98–115.
  4. H. Garcia-Molina. Elections in a Distributed Computing System. 1982. IEEE Trans. Computers, 31(1):48–59.
  5. Suresh Singh and James F. Kurose. 1994. Electing Good Leaders. Journal of Parallel and Distributed Computing, Volume 21, Issue 2, pages 184–201.
  6. Scott D. Stoller. 2000. Leader election in asynchronous distributed systems. IEEE Transactions on Computers, 49(3):283–284.
  7. AL-Metwally Mostafa, Ilies Alouini. Fault Tolerant Global Store Module. (2004). http://www. mozart-oz. org/mogul/doc/metwally/globalstore/
  8. Valentin Mesaros , Raphaël Collet , Kevin Glynn , Peter Van Roy. 2005. A Transactional System for Structured Overlay Networks. Research Report RR2005-01, Universit´e catholique de Louvain, D´epartement INGI.
  9. E. Cecchet, G. Candea, and A. Ailamaki. 2008. Middleware-based Database Replication: The Gaps Between Theory and Practice. ACM SIGMOD Conference, Vancouver, Canada.
  10. Rachid Guerraoui and Luís Rodrigues. 2006. Introduction to Reliable Distributed Programming. Springer-Verlag Berlin Heidelberg.
  11. Y. Saito and M. Shapiro. 2002. Replication: Optimistic Approaches. Technical Report, Microsoft Research Ltd.
  12. W. Lang, J. Patel, and J. Naughton. 2009. On Energy Management Load Balancing and Replication. SIGMOD Record, Vol. 38, No, 4, 2009
  13. M. EffatParvar, N. Yazdani, Mehdi. EffatParvar, A. Dadlani, and A. Khonsari. 2010. Improved Algorithm for Leadership Election in Distributed Systems. 2nd International Conference on Computer Engineering and Technology.
  14. D. P. Gawali. 2012. Leader Election Problem in Distributed Algorithm. International Journal of Computer Science and Technology.
  15. Mina Shirali, Abolfazl HaghighatToroghi, and Mehdi Vojdani. 2008. Leader Election Algorithms: History and Novel schemes. Third 2008 International Conference on Convergence and Hybrid Information Technology.
  16. Robbert van Renesse and Fred B. Schneider. 2004. Chain Replication for Supporting High Throughput and Availability. In OSDI'04: Proceedings of the 6th conference on Symposium on Operating Systems Design & Implementation, Berkeley, CA, USA.
  17. Heutelbeck, D. and Hemmje, M. 2006. Distributed Leader Election in P2P Systems for Dynamic Sets. MDM, Page(s): 29 – 29.
  18. Zargarnataj, M. 2007. New Election Algorithm Based on Assistant in Distributed Systems", AICCSA 07, Page(s):324 – 331.
  19. Summary of the Amazon EC2 and Amazon RDS Service Disruption in the US East Region. 2001. http://aws. amazon. com/message/65648/
Index Terms

Computer Science
Information Sciences

Keywords

Primary Backup Replication Leader Election Transaction Store Replication Systems