Deployment and testing of the sustained petascale Blue Waters system

Research output: Contribution to journalArticle

Abstract

Deployment of a large parallel system typically involves several steps of preparation, delivery, installation, testing and acceptance, making such deployments a very complex process. Despite the availability of various petascale systems currently, the steps and lessons from their deployment are rarely described in the literature. This article documents our experiences from the deployment of the sustained petascale Blue Waters system at NCSA. Our presentation is focused on the final deployment steps, where the system was intensively tested and accepted by NCSA. Those experiences and lessons should be useful to guide similarly complex deployments of large systems in the future.

Original languageEnglish (US)
Pages (from-to)327-337
Number of pages11
JournalJournal of Computational Science
Volume10
DOIs
StatePublished - Sep 1 2015

Fingerprint

Availability
Water
Testing
Parallel Systems
Preparation
Experience

Keywords

  • Acceptance testing
  • Large system deployment
  • Petascale performance

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)
  • Modeling and Simulation

Cite this

@article{60c2bb71685e4f2499fcb3114c6fabf2,
title = "Deployment and testing of the sustained petascale Blue Waters system",
abstract = "Deployment of a large parallel system typically involves several steps of preparation, delivery, installation, testing and acceptance, making such deployments a very complex process. Despite the availability of various petascale systems currently, the steps and lessons from their deployment are rarely described in the literature. This article documents our experiences from the deployment of the sustained petascale Blue Waters system at NCSA. Our presentation is focused on the final deployment steps, where the system was intensively tested and accepted by NCSA. Those experiences and lessons should be useful to guide similarly complex deployments of large systems in the future.",
keywords = "Acceptance testing, Large system deployment, Petascale performance",
author = "Mendes, {Celso L.} and Brett Bode and Bauer, {Gregory H.} and Jeremy Enos and Cristina Beldica and Kramer, {William T.}",
year = "2015",
month = "9",
day = "1",
doi = "10.1016/j.jocs.2015.03.007",
language = "English (US)",
volume = "10",
pages = "327--337",
journal = "Journal of Computational Science",
issn = "1877-7503",
publisher = "Elsevier",

}

TY - JOUR

T1 - Deployment and testing of the sustained petascale Blue Waters system

AU - Mendes, Celso L.

AU - Bode, Brett

AU - Bauer, Gregory H.

AU - Enos, Jeremy

AU - Beldica, Cristina

AU - Kramer, William T.

PY - 2015/9/1

Y1 - 2015/9/1

N2 - Deployment of a large parallel system typically involves several steps of preparation, delivery, installation, testing and acceptance, making such deployments a very complex process. Despite the availability of various petascale systems currently, the steps and lessons from their deployment are rarely described in the literature. This article documents our experiences from the deployment of the sustained petascale Blue Waters system at NCSA. Our presentation is focused on the final deployment steps, where the system was intensively tested and accepted by NCSA. Those experiences and lessons should be useful to guide similarly complex deployments of large systems in the future.

AB - Deployment of a large parallel system typically involves several steps of preparation, delivery, installation, testing and acceptance, making such deployments a very complex process. Despite the availability of various petascale systems currently, the steps and lessons from their deployment are rarely described in the literature. This article documents our experiences from the deployment of the sustained petascale Blue Waters system at NCSA. Our presentation is focused on the final deployment steps, where the system was intensively tested and accepted by NCSA. Those experiences and lessons should be useful to guide similarly complex deployments of large systems in the future.

KW - Acceptance testing

KW - Large system deployment

KW - Petascale performance

UR - http://www.scopus.com/inward/record.url?scp=84941277683&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84941277683&partnerID=8YFLogxK

U2 - 10.1016/j.jocs.2015.03.007

DO - 10.1016/j.jocs.2015.03.007

M3 - Article

AN - SCOPUS:84941277683

VL - 10

SP - 327

EP - 337

JO - Journal of Computational Science

JF - Journal of Computational Science

SN - 1877-7503

ER -