Abstract provenance graphs: Anticipating and exploiting schema-level data provenance

Daniel Zinn, Bertram Ludäscher

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Provenance graphs capture flow and dependency information recorded during scientific workflow runs, which can be used subsequently to interpret, validate, and debug workflow results. In this paper, we propose the new concept of Abstract Provenance Graphs (APGs). APGs are created via static analysis of a configured workflow W and input data schema, i.e., before W is actually executed. They summarize all possible provenance graphs the workflow W can create with input data of type τ, that is, for each input ν ∈ τ there exists a graph homomorphism ℋν between the concrete and abstract provenance graph. APGs are helpful during workflow construction since (1) they make certain workflow design-bugs (e.g., selecting none or wrong input data for the actors) easy to spot; and (2) show the evolution of the overall data organization of a workflow. Moreover, after workflows have been run, APGs can be used to validate concrete provenance graphs. A more detailed version of this work is available as [14].

Original languageEnglish (US)
Title of host publicationProvenance and Annotation of Data and Processes - Third International Provenance and Annotation Workshop, IPAW 2010, Revised Selected Papers
Pages206-215
Number of pages10
DOIs
StatePublished - 2010
Externally publishedYes
Event3rd International Provenance and Annotation Workshop, IPAW 2010 - Troy, NY, United States
Duration: Jun 15 2010Jun 16 2010

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume6378 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other3rd International Provenance and Annotation Workshop, IPAW 2010
CountryUnited States
CityTroy, NY
Period6/15/106/16/10

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'Abstract provenance graphs: Anticipating and exploiting schema-level data provenance'. Together they form a unique fingerprint.

Cite this