Failure detection vs group membership in fault-tolerant distributed systems: Hidden trade-offs

6Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Failure detection and group membership are two important components of fault-tolerant distributed systems. Understanding their role is essential when developing efficient solutions, not only in failure-free runs, but also in runs in which processes do crash. While group membership provides consistent information about the status of processes in the system, failure detectors provide inconsistent information. This paper discusses the trade-offs related to the use of these two components, and clarifies their roles using three examples. The first example shows a case where group membership may favourably be replaced by a failure detection mechanism. The second example illustrates a case where group membership is mandatory. Finally, the third example shows a case where neither group membership nor failure detectors are needed (they may be replaced by weak ordering oracles).

Cite

CITATION STYLE

APA

Schiper, A. (2002). Failure detection vs group membership in fault-tolerant distributed systems: Hidden trade-offs. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2399, pp. 1–15). Springer Verlag. https://doi.org/10.1007/3-540-45605-8_1

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free