Turbine: A distributed-memory dataflow engine for extreme-scale many-task applications

Justin M. Wozniak, Timothy G. Armstrong, Ketan Maheshwari, Ewing L. Lusk, Daniel S. Katz, Michael Wilde, Ian T. Foster

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Efficiently utilizing the rapidly increasing concurrency of multi-petaop computing systems is a significant program- ming challenge. One approach is to structure applications with an upper layer of many loosely-coupled coarse-grained tasks, each comprising a tightly-coupled parallel function or program. "Many-task" programming models such as functional parallel dataflow may be used at the upper layer to generate massive numbers of tasks, each of which generates significant tighly-coupled parallelism at the lower level via multithreading, message passing, and/or partitioned global address spaces. At large scales, however, the management of task distribution, data dependencies, and inter-task data movement is a significant performance challenge. In this work, we describe Turbine, a new highly scalable and distributed many-task dataflow engine. Turbine executes a generalized many-task intermediate representation with automated self-distribution, and is scalable to multi-petaop infrastructures. We present here the architecture of Turbine and its performance on highly concurrent systems.

Original languageEnglish (US)
Title of host publicationProceedings of the 1st ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies, SWEET 2012
DOIs
StatePublished - 2012
Externally publishedYes
Event1st ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies, SWEET 2012 - Scottsdale, AZ, United States
Duration: May 20 2012May 20 2012

Publication series

NameACM International Conference Proceeding Series

Other

Other1st ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies, SWEET 2012
Country/TerritoryUnited States
CityScottsdale, AZ
Period5/20/125/20/12

Keywords

  • ADLB
  • Concurrency
  • Dataflow
  • Exascale
  • MPI
  • Swift
  • Turbine

ASJC Scopus subject areas

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Turbine: A distributed-memory dataflow engine for extreme-scale many-task applications'. Together they form a unique fingerprint.

Cite this