The Case for Workflow-Aware Storage:An Opportunity Study

L. B. Costa, H. Yang, E. Vairavanathan, A. Barros, K. Maheshwari, G. Fedak, D. Katz, M. Wilde, M. Ripeanu, S. Al-Kiswany

Research output: Contribution to journalArticlepeer-review

Abstract

This article evaluates the potential gains a workflow-aware storage system can bring. Two observations make us believe such storage system is crucial to efficiently support workflow-based applications: First, workflows generate irregular and application-dependent data access patterns. These patterns render existing generic storage systems unable to harness all optimization opportunities as this often requires enabling conflicting optimizations or even conflicting design decisions at the storage system level. Second, most workflow runtime engines make suboptimal scheduling decisions as they lack the detailed data location information that is generally hidden by the storage system. This paper presents a limit study that evaluates the potential gains from building a workflow-aware storage system that supports per-file access optimizations and exposes data location. Our evaluation using synthetic benchmarks and real applications shows that a workflow-aware storage system can bring significant performance gains: up to 3x performance gains compared to a vanilla distributed storage system deployed on the same resources yet unaware of the possible file-level optimizations.

Original languageEnglish (US)
Pages (from-to)95-113
Number of pages19
JournalJournal of Grid Computing
Volume13
Issue number1
DOIs
StatePublished - Mar 2015
Externally publishedYes

Keywords

  • Large-scale storage system
  • Workflow runtime engine
  • Workflow-aware
  • storage system

ASJC Scopus subject areas

  • Software
  • Information Systems
  • Hardware and Architecture
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'The Case for Workflow-Aware Storage:An Opportunity Study'. Together they form a unique fingerprint.

Cite this