Abstract
Creating online data repositories that follow Findable, Accessible, Interoperable, and Reusable (FAIR) principles has been a significant focus in the research community to address the reproducibility crisis facing many computational fields, including environmental modeling. However, less work has focused on another reproducibility challenge: capturing modeling software and computational environments needed to reproduce complex modeling workflows. Containerization technology offers an opportunity to address this need, and there are a growing number of strategies being put forth that leverage containerization to improve the reproducibility of environmental modeling. This research compares ten such approaches using a hydrologic model application as a case study. For each approach, we use both quantitative and qualitative metrics for comparing the different strategies. Based on the results, we discuss challenges and opportunities for containerization in environmental modeling and recommend best practices across both research and educational use cases for when and how to apply the different containerization-based strategies.
Original language | English (US) |
---|---|
Article number | 105760 |
Journal | Environmental Modelling and Software |
Volume | 167 |
DOIs | |
State | Published - Sep 2023 |
Keywords
- Cloud computing
- Container technology
- Cyberinfrastructure
- Hydrologic modeling
- Jupyter notebooks
- Reproducibility
ASJC Scopus subject areas
- Software
- Environmental Engineering
- Ecological Modeling