TY - GEN
T1 - X-CSR
T2 - 25th IEEE International Conference on Data Engineering, ICDE 2009
AU - Zinn, Daniel
AU - Bowersy, Shawn
AU - McPhillipsy, Timothy
AU - Ludascher, Bertram
PY - 2009
Y1 - 2009
N2 - XML process networks are a simple, yet powerful programming paradigm for loosely coupled, coarse-grained dataflow applications such as data-centric scientific workflows. We describe a framework called δ-XML that is well-suited for applications in which pipelines of data processors modify parts ("deltas") of XML data collections while keeping the overall collection structure intact.We show how to optimize the execution of δ-XML process networks by minimizing the data shipping cost in distributed settings. This X-CSR 1 optimization employs static type inference based on XML Schema to determine the XML stream fragments that are relevant to a processor, allowing irrelevant fragments to be bypassed ("shipped") to downstream pipeline steps. Finally, we present evaluation results for a realworld scientific workflow, which shows the practical feasibility of X-CSR. A long version of this paper is available as [1].
AB - XML process networks are a simple, yet powerful programming paradigm for loosely coupled, coarse-grained dataflow applications such as data-centric scientific workflows. We describe a framework called δ-XML that is well-suited for applications in which pipelines of data processors modify parts ("deltas") of XML data collections while keeping the overall collection structure intact.We show how to optimize the execution of δ-XML process networks by minimizing the data shipping cost in distributed settings. This X-CSR 1 optimization employs static type inference based on XML Schema to determine the XML stream fragments that are relevant to a processor, allowing irrelevant fragments to be bypassed ("shipped") to downstream pipeline steps. Finally, we present evaluation results for a realworld scientific workflow, which shows the practical feasibility of X-CSR. A long version of this paper is available as [1].
UR - http://www.scopus.com/inward/record.url?scp=67649668619&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=67649668619&partnerID=8YFLogxK
U2 - 10.1109/ICDE.2009.72
DO - 10.1109/ICDE.2009.72
M3 - Conference contribution
AN - SCOPUS:67649668619
SN - 9780769535456
T3 - Proceedings - International Conference on Data Engineering
SP - 577
EP - 580
BT - Proceedings - 25th IEEE International Conference on Data Engineering, ICDE 2009
Y2 - 29 March 2009 through 2 April 2009
ER -