AVMON: Optimal and scalable discovery of consistent availability monitoring overlays for distributed systems

Ramsés Morales, Indranil Gupta

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper addresses the problem of selection and discovery of a consistent availability monitoring overlay for computer hosts in a large-scale distributed application, where hosts may be selfish or colluding. We motivate six significant goals for the problem - consistency, verifiability, and randomness, in selecting the availability monitors of nodes, as well as discoverability, load-balancing, and scalability in finding these monitors. We then present a new system, called AVMON, that is the first to satisfy these six requirements. The core algorithmic contribution of this paper is a protocol for discovering the availability monitoring overlay in a scalable and efficient manner, given any arbitrary monitor selection scheme that is consistent and verifiable. We mathematically analyze the performance of AVMON's discovery protocols, and derive an optimal variant that minimizes memory, bandwidth, computation, and discovery time of monitors. Our experimental evaluations of AVMON use three types of availability traces - synthetic, from PlanetLab, and from a peer-to-peer system (Overnet) - and demonstrate that AVMON works well in a variety of distributed systems.

Original languageEnglish (US)
Title of host publication27th International Conference on Distributed Computing Systems, ICDCS'07
DOIs
StatePublished - 2007
Event27th International Conference on Distributed Computing Systems, ICDCS'07 - Toronto, ON, Canada
Duration: Jun 25 2007Jun 27 2007

Publication series

NameProceedings - International Conference on Distributed Computing Systems

Other

Other27th International Conference on Distributed Computing Systems, ICDCS'07
Country/TerritoryCanada
CityToronto, ON
Period6/25/076/27/07

Keywords

  • Availability
  • Churn
  • Consistency
  • Monitoring
  • Optimality
  • Overlay
  • Scalability

ASJC Scopus subject areas

  • Software
  • Hardware and Architecture
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'AVMON: Optimal and scalable discovery of consistent availability monitoring overlays for distributed systems'. Together they form a unique fingerprint.

Cite this