CFP last date
22 April 2024
Reseach Article

Protocol for Coordinated Checkpointing using Smart Interval with Dual Coordinator

by Manoj Kumar Niranjan, Mahesh Motwani
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 99 - Number 11
Year of Publication: 2014
Authors: Manoj Kumar Niranjan, Mahesh Motwani
10.5120/17416-8199

Manoj Kumar Niranjan, Mahesh Motwani . Protocol for Coordinated Checkpointing using Smart Interval with Dual Coordinator. International Journal of Computer Applications. 99, 11 ( August 2014), 15-19. DOI=10.5120/17416-8199

@article{ 10.5120/17416-8199,
author = { Manoj Kumar Niranjan, Mahesh Motwani },
title = { Protocol for Coordinated Checkpointing using Smart Interval with Dual Coordinator },
journal = { International Journal of Computer Applications },
issue_date = { August 2014 },
volume = { 99 },
number = { 11 },
month = { August },
year = { 2014 },
issn = { 0975-8887 },
pages = { 15-19 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume99/number11/17416-8199/ },
doi = { 10.5120/17416-8199 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:27:55.896865+05:30
%A Manoj Kumar Niranjan
%A Mahesh Motwani
%T Protocol for Coordinated Checkpointing using Smart Interval with Dual Coordinator
%J International Journal of Computer Applications
%@ 0975-8887
%V 99
%N 11
%P 15-19
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Checkpointing is a very popular technique for fault tolerance in distributed systems. The proposed protocol tolerates the transient faults. In the protocol, all processes take checkpoints to form a global consistent checkpoint. The protocol handles the failures of initiator and non-initiator.

References
  1. Introduction to Distributed System Design, Google Code University, http://code. google. com/edu/parallel/dsd-tutorial. html#Basics
  2. D. Manivannan, R. H. B. Netzer & M. Singhal, "Finding Consistent Global Checkpoints in a Distributed Computation", IEEE Trans. On Parallel & Distributed Systems, Vol. 8, No. 6, pp. 623-627 (June 1997)
  3. J. Tsai & S. Kuo, "Theoretical Analysis for Communication-Induced Checkpointing Protocols with Rollback-Dependency Trackability"; IEEE Trans. On Parallel & Distributed Systems, Vol. 9, No. 10, pp. 963-971 (October 1998)
  4. B. Bhargava and S. R. Lian, "Independent Checkpointing and Concurrent Rollback for Recovery in Distributed Systems-An Optimistic Approach", Proceeding of IEEE Symposium on Reliable Distributed Systems, pp. 3-12 (1988)
  5. Guohong Cao, and Mukesh Singhal, "On Coordinated Checkpointing in Distributed Systems," IEEE Transactions On Parallel And Distributed Systems," Vol. 9, No. 12, pp. 1213-122 (Dec. 1998)
  6. Sharma D. D. and Pradhan D. K. , "An Efficient Coordinated Checkpointing Scheme for Multicomputers," Proc. IEEE Workshop on Fault-Tolerant Parallel and Distributed Systems, pp 36-42 (June 1994)
  7. E. N. Elnozahy, D. B. Johnson, and W. Zwaenepoel, "The Performance of Consistent Checkpointing," Proc. 11th Symp. Reliable Distributed Systems, pp. 39–47 (Oct. 1992)
  8. E. N. (Mootaz) Elnozahy, Lorenzo Alvisi, Yi-Min Wang and David B. Johnson, "A Survey of Rollback-Recovery Protocols in Message-Passing Systems", ACM Computing Surveys (CSUR), Volume 34, Issue 3 (September 2002) Page(s):375-408 (2002)
  9. Ch. D. V. Subba Rao and M. M. Naidu, "A New, Efficient Coordinated Checkpointing Protocol Combined with Selective Sender-Based Message Logging", IEEE/ACS International Conference on Computer Systems and Applications, AICCSA 2008, pp. 444-447 (2008)
  10. Sarmistha Neogy, Anupam Sinha, Pradip K Das, "CCUML: A Checkpointing Protocol for Distributed System Processes", IEEE Transactions on TENCON 2004, IEEE Region 10 Conference, Volume B, 21-24 Nov. 2004, Page(s):553 – 556 (2004)
  11. J. Makhijani, M. K. Niranjan, M. Motwani, A. K. Sachan, A. Rajput, "An efficient protocol using smart interval for coordinated checkpointing", International Conference on Advances in Information Technology and Mobile Communication – AIM 2011
  12. K. M. Chandy & L. Lamport, "Distributed Snapshots: Determining Global States of Distributed Systems", ACM Trans. On Computer Systems, Vol. 3, no. , Feb 1985, pp 63-75 (1985)
Index Terms

Computer Science
Information Sciences

Keywords

Distributed Systems Checkpointing Fault Tolerance Smart Interval.