TY - JOUR
T1 - A framework for the design and reuse of grid workflows
AU - Altintas, Ilkay
AU - Birnbaum, Adam
AU - Baldridge, Kim K.
AU - Sudholt, Wibke
AU - Miller, Mark
AU - Amoreira, Celine
AU - Potier, Yohann
AU - Ludaescher, Bertram
N1 - Copyright:
Copyright 2020 Elsevier B.V., All rights reserved.
PY - 2005
Y1 - 2005
N2 - Grid workflows can be seen as special scientific workflows involving high performance and/or high throughput computational tasks. Much work in grid workflows has focused on improving application performance through schedulers that optimize the use of computational resources and bandwidth. As high-end computing resources are becoming more of a commodity that is available to new scientific communities, there is an increasing need to also improve the design and reusability "performance" of scientific workflow systems. To this end, we are developing a framework that supports the design and reuse of grid workflows. Individual workflow components (e.g., for data movement, database querying, job scheduling, remote execution etc.) are abstracted into a set of generic, reusable tasks. Instantiations of these common tasks can be functionally equivalent atomic components (called actors) or composite components (so-called composite actors or subworkflows). In this way, a grid workflow designer does not have to commit to a particular Grid technology when developing a scientific workflow; instead different technologies (e.g. GridFTP, SRB, and scp) can be used interchangeably and in concert. We illustrate the application of our framework using two real-world Grid workflows from different scientific domains, i.e., cheminformatics and bioinformatics, respectively.
AB - Grid workflows can be seen as special scientific workflows involving high performance and/or high throughput computational tasks. Much work in grid workflows has focused on improving application performance through schedulers that optimize the use of computational resources and bandwidth. As high-end computing resources are becoming more of a commodity that is available to new scientific communities, there is an increasing need to also improve the design and reusability "performance" of scientific workflow systems. To this end, we are developing a framework that supports the design and reuse of grid workflows. Individual workflow components (e.g., for data movement, database querying, job scheduling, remote execution etc.) are abstracted into a set of generic, reusable tasks. Instantiations of these common tasks can be functionally equivalent atomic components (called actors) or composite components (so-called composite actors or subworkflows). In this way, a grid workflow designer does not have to commit to a particular Grid technology when developing a scientific workflow; instead different technologies (e.g. GridFTP, SRB, and scp) can be used interchangeably and in concert. We illustrate the application of our framework using two real-world Grid workflows from different scientific domains, i.e., cheminformatics and bioinformatics, respectively.
UR - http://www.scopus.com/inward/record.url?scp=24644481094&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=24644481094&partnerID=8YFLogxK
U2 - 10.1007/11423287_11
DO - 10.1007/11423287_11
M3 - Conference article
AN - SCOPUS:24644481094
SN - 0302-9743
VL - 3458
SP - 120
EP - 133
JO - Lecture Notes in Computer Science
JF - Lecture Notes in Computer Science
T2 - First International Workshop on Scientific Applications of Grid Computing, SAG 2004
Y2 - 20 September 2004 through 24 September 2004
ER -