Intelligent monitoring for adaptation in grid applications

Daniel A. Reed, Celso L. Mendes

Research output: Contribution to journalArticlepeer-review

Abstract

Grid applications access distributed, and often shared, resources. One consequence of this resource sharing is that measured application performance can vary widely and in unexpected ways. Determining the causes of poor performance, due to either anomalous application behavior or contention for shared resource use, and adapting to changing circumstances are critical to creation of robust Grid applications. Performance contracts and real-time adaptive control are two mechanisms to realize soft performance guarantees in Grid environments. Performance contracts formalize the relationship between application performance needs and resource capabilities. During execution, contract monitors use performance data to verify that expectations are met. When the contracted specifications are not satisfied, the system can choose to either adapt the application to available resources or reschedule the application on a new set of resources that can satisfy the original contract specifications. In this paper, we describe an infrastructure for Grid application contract development and monitoring. This infrastructure, based on the Autopilot toolkit, provides flexible and scalable tools to assess both application and system behavior.

Original languageEnglish (US)
Pages (from-to)426-434
Number of pages9
JournalProceedings of the IEEE
Volume93
Issue number2
DOIs
StatePublished - Feb 2005

Keywords

  • Adaptive control
  • Distributed computing
  • Monitoring
  • Parallel processing

ASJC Scopus subject areas

  • General Computer Science
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Intelligent monitoring for adaptation in grid applications'. Together they form a unique fingerprint.

Cite this