Toward open and reproducible environmental modeling by integrating online data repositories, computational environments, and model Application Programming Interfaces

Young Don Choi, Jonathan L. Goodall, Jeffrey M. Sadler, Anthony M. Castronova, Andrew Bennett, Zhiyu Li, Bart Nijssen, Shaowen Wang, Martyn P. Clark, Daniel P. Ames, Jeffery S. Horsburgh, Hong Yi, Christina Bandaragoda, Martin Seul, Richard Hooper, David G. Tarboton

Research output: Contribution to journalArticlepeer-review

Abstract

Cyberinfrastructure needs to be advanced to enable open and reproducible environmental modeling research. Recent efforts toward this goal have focused on advancing online repositories for data and model sharing, online computational environments along with containerization technology and notebooks for capturing reproducible computational studies, and Application Programming Interfaces (APIs) for simulation models to foster intuitive programmatic control. The objective of this research is to show how these efforts can be integrated to support reproducible environmental modeling. We present first the high-level concept and general approach for integrating these three components. We then present one possible implementation that integrates HydroShare (an online repository), CUAHSI JupyterHub and CyberGIS-Jupyter for Water (computational environments), and pySUMMA (a model API) to support open and reproducible hydrologic modeling. We apply the example implementation for a hydrologic modeling use case to demonstrate how the approach can advance reproducible environmental modeling through the seamless integration of cyberinfrastructure services.

Original languageEnglish (US)
Article number104888
JournalEnvironmental Modelling and Software
Volume135
DOIs
StatePublished - Jan 2021

Keywords

  • Containers
  • JupyterHub
  • Modeling frameworks
  • Open hydrology
  • Reproducibility

ASJC Scopus subject areas

  • Software
  • Environmental Engineering
  • Ecological Modeling

Fingerprint

Dive into the research topics of 'Toward open and reproducible environmental modeling by integrating online data repositories, computational environments, and model Application Programming Interfaces'. Together they form a unique fingerprint.

Cite this