Modeling and querying scientific workflow provenance in the D-OPM

Victor Cuevas-Vicenttin, Saumen Dey, Michael Li Yuan Wang, Tianhong Song, Bertram Ludascher

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We present the D-OPM, a model that extends the Open Provenance Model (OPM) with workflow-specific aspects. In particular, our model captures aspects such as the workflow structure, traces, data structure, and workflow evolution. Thus, it enables scientists to obtain detailed information about the origin of data resulting from past experiments, as well as about the process itself and its possible future executions. A reference implementation of the D-OPM validates our model and opens the opportunity for interoperation with multiple workflow systems. Furthermore, to facilitate querying D-OPM data we introduce a querying mechanism based on regular path queries (RPQs) on provenance graphs. Our RPQs evaluator is built on a relational DBMS which makes it robust and extensible.

Original languageEnglish (US)
Title of host publicationProceedings - 2012 SC Companion
Subtitle of host publicationHigh Performance Computing, Networking Storage and Analysis, SCC 2012
Pages119-128
Number of pages10
DOIs
StatePublished - 2012
Externally publishedYes
Event2012 SC Companion: High Performance Computing, Networking Storage and Analysis, SCC 2012 - Salt Lake City, UT, United States
Duration: Nov 10 2012Nov 16 2012

Publication series

NameProceedings - 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, SCC 2012

Other

Other2012 SC Companion: High Performance Computing, Networking Storage and Analysis, SCC 2012
CountryUnited States
CitySalt Lake City, UT
Period11/10/1211/16/12

ASJC Scopus subject areas

  • Computational Theory and Mathematics
  • Computer Science Applications
  • Software

Fingerprint Dive into the research topics of 'Modeling and querying scientific workflow provenance in the D-OPM'. Together they form a unique fingerprint.

Cite this