Other Journals Published by Timeline Publication Pvt. Ltd.
An Overview of Checkpointing Techniques for Fault Tolerance in Distributed Computing Systems
-
Jagdish Makhijani; Dr. Anil Rajput
- Checkpointing is an important feature in
distributed computing systems. It gives fault tolerance
without requiring additional efforts from the
programmer[1]. In order to provide fault tolerance for
distributed systems, the checkpointing technique has widely
been used and many researchers have been performed to
reduce the overhead of checkpointing coordination. A
checkpoint is a snapshot of the current state of a process. It
saves enough information in non-volatile stable storage such
that, if the contents of the volatile storage are lost due to
process failure, one can reconstruct the process state from
the information saved in the non-volatile stable storage [1].
- Select Volume / Issues:
- Year:
- 2012
- Type of Publication:
- Article
- Keywords:
- Backup passive module; Transient; Orphan message; Domino effect; Coordinated check pointing; Livelocks problems
- Journal:
- IJECCE
- Volume:
- 3
- Number:
- 1
- Pages:
- 133 - 137
- Month:
- January
Hits: 2288