skip to main content
10.1145/2034640.2034645acmotherconferencesArticle/Chapter ViewAbstractPublication PagestaddsConference Proceedingsconference-collections
research-article

What model and what conditions to implement unreliable failure detectors in dynamic networks?

Published: 19 September 2011 Publication History

Abstract

Failure detectors are classical mechanisms which provide information about process failures and can help systems to cope with the high dynamics of self-organizing, unstructured and mobile wireless networks. Unreliable failure detectors of class ◊S are of special interest because they meet the weakest assumptions able to solve fundamental problems on the design of dependable systems. Unfortunately, a negative result states that no failure detector of that class can be implemented in a network of an unknown membership; but full membership knowledge as well as fully communication connectivity are no longer appropriate assumptions to the new scenario of dynamic networks. In this paper, we provide a discussion about the conditions and model able to implement failure detectors in dynamic networks and define a new class, namely ◊SM, which adapts the properties of the ◊S class to a dynamic network with an unknown membership.

References

[1]
M. K. Aguilera. A pleasant stroll through the land of infinitely many creatures. SIGACT News, 35(2):36--59, 2004.
[2]
M. K. Aguilera, W. Chen, and S. Toueg. Heartbeat: A timeout-free failure detector for quiescent reliable communication. In Proc. 11th Int. Work. on Distributed Algorithms, pages 126--140, 1997.
[3]
V. Bhandari and N. H. Vaidya. On reliable broadcast in a radio network. In 24th ACM Symp. on Principles of distributed computing, pages 138--147. ACM, 2005.
[4]
V. Bhandari and N. H. Vaidya. Reliable local broadcast in a wireless network prone to byzantine failures. In 4th ACM Int. Work. on Foundations of Mobile Computing, 2007.
[5]
V. Bhandari and N. H. Vaidya. Reliable broadcast in radio networks with locally bounded failures. IEEE Trans. on Parallel and Distributed Systems, 21:801--811, 2010.
[6]
T. Camp, J. Boleng, and V. Davies. A survey of mobility models for ad hoc network research. Wireless Communications & Mobile Computing: Special issue on Mobile Ad Hoc Networking: Research, Trends and Applications, 2:483--502, 2002.
[7]
J. Cao, M. Raynal, C. Travers, and W. Wu. The eventual leadership in dynamic mobile networking environments. In 3th Pacific Rim Int. Symp. on Dependable Computing, pages 123--130, 2007.
[8]
A. Casteigts, S. Chaumette, and A. Ferreira. Characterizing topological assumptions of distributed algorithms in dynamic networks. In Structural Information and Communication Complexity Conf., pages 126--140, 2010.
[9]
A. Casteigts, P. Flocchini, W. Quattrociocchi, and N. Santoro. Time-varying graphs and dynamic networks. Technical report, University of Ottawa, 2011.
[10]
T. Chandra and S. Toueg. Unreliable failure detectors for reliable distributed systems. Journal of the ACM, 43(2):225--267, Mar. 1996.
[11]
B. Devianov and S. Toueg. Failure detector service for dependable computing. In Proc. 1st Int. Conf. on Dependable Syst. and Networks, pages 14--15, 2000.
[12]
R. Friedman and G. Tcharny. Evaluating failure detection in mobile ad-hoc networks. Int. Journal of Wireless and Mobile Computing, 1(8), 2005.
[13]
F. Greve, P. Sens, L. Arantes, and V. Simon. A failure detector for wireless networks with unknown membership. In Euro-Par Conference, LNCS 6853, pages 27--38, 2011.
[14]
I. Gupta, T. D. Chandra, and G. S. Goldszmidt. On scalable and efficient distributed failure detectors. In Proc. of the 20th Symp. on Principles of Distributed Computing, pages 170--179, 2001.
[15]
M. Hutle. An efficient failure detector for sparsely connected networks. In Proc. IASTED Int. Conf. on Parallel and Distributed Computing and Networks, pages 369--374, 2004.
[16]
E. Jiménez, S. Arévalo, and A. Fernández. Implementing unreliable failure detectors with unknown membership. Inf. Process. Lett., 100(2):60--63, 2006.
[17]
C.-Y. Koo. Broadcast in radio networks tolerating byzantine adversarial behavior. In 23th Symp. on Princ. of Distributed Computing, pages 275--282, 2004.
[18]
S. Min-Te, H. Lifei, A. A. Arora, and L. Ten-Hwang. Reliable mac layer multicast in ieee 802.11 wireless networks. In Proc. of the Int. Conference on Parallel Processing, pages 527--536, Aug. 2002.
[19]
A. Mostefaoui, E. Mourgaya, and M. Raynal. Asynchronous implementation of failure detectors. In Proc. of Int. Conf. on Dependable Systems and Networks, 2003.
[20]
A. Mostefaoui, M. Raynal, and C. Travers. Time-free and timer-based assumptions can be combined to obtain eventual leadership. IEEE Trans. Parallel Distrib. Syst., 17(7):656--666, 2006.
[21]
A. Mostefaoui, M. Raynal, C. Travers, S. Patterson, D. Agrawal, and A. Abbadi. From static distributed systems to dynamic systems. In 24th IEEE Symp. on Reliable Distributed Systems, pages 109--118, 2005.
[22]
N. Sridhar. Decentralized local failure detection in dynamic distributed systems. The 25th IEEE Symp. on Reliable Distributed Systems, 0:143--154, 2006.
[23]
A. Tai, K. Tso, and W. Sanders. Cluster-based failure detection service for large-scale ad hoc wireless network applications. In Int. Conf. on Dependable Systems and Networks, pages 805--814, 2004.
[24]
S. Tucci-Piergiovanni and R. Baldoni. Eventual leader election in infinite arrival message-passing system model with bounded concurrency. In Dependable Computing Conference (EDCC), pages 127--134, 2010.
[25]
R. Van Renesse, Y. Minsky, and M. Hayden. A gossip-style failure detection service. Technical report, Ithaca, NY, USA, 1998.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
TADDS '11: Proceedings of the 3rd International Workshop on Theoretical Aspects of Dynamic Distributed Systems
September 2011
33 pages
ISBN:9781450309462
DOI:10.1145/2034640
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 September 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. asynchronous systems
  2. dynamic distributed systems
  3. failure detector
  4. wireless mobile networks

Qualifiers

  • Research-article

Conference

TADDS '11

Acceptance Rates

Overall Acceptance Rate 5 of 8 submissions, 63%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 04 Feb 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media