Failure detection vs group membership in fault-tolerant distributed systems: Hidden trade-offs

André Schiper

Conference Proceedings

Failure detection vs group membership in fault-tolerant distributed systems: Hidden trade-offs

Schiper A

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2002) 2399 1-15

DOI: 10.1007/3-540-45605-8_1

6Citations

8Readers

Get full text

Abstract

Failure detection and group membership are two important components of fault-tolerant distributed systems. Understanding their role is essential when developing efficient solutions, not only in failure-free runs, but also in runs in which processes do crash. While group membership provides consistent information about the status of processes in the system, failure detectors provide inconsistent information. This paper discusses the trade-offs related to the use of these two components, and clarifies their roles using three examples. The first example shows a case where group membership may favourably be replaced by a failure detection mechanism. The second example illustrates a case where group membership is mandatory. Finally, the third example shows a case where neither group membership nor failure detectors are needed (they may be replaced by weak ordering oracles).

Cite

CITATION STYLE

APA

Schiper, A. (2002). Failure detection vs group membership in fault-tolerant distributed systems: Hidden trade-offs. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2399, pp. 1–15). Springer Verlag. https://doi.org/10.1007/3-540-45605-8_1

Failure detection vs group membership in fault-tolerant distributed systems: Hidden trade-offs

Abstract

Cite

Register to see more suggestions