Understanding Impact of Lossy Compression on Derivative-related Metrics in Scientific Datasets

Zhaoyuan Su, Sheng Di, Ali Murat Gok, Yue Cheng, Franck Cappello

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Today's scientific simulations are producing ex-tremely large amount of data everyday, which induces grand challenges in transferring and storing the data efficiently. Error-bounded lossy compression has been thought of as the most promising solution to the bigdata issue, however, it would cause data distortion that has to be controlled carefully for user's post-hoc analysis. Recently, the preservation of quantities of interest has become a priority. Derivative-related metrics are critical quantities of interest for many applications across domains. How-ever, no prior research explored the impact of lossy compression on derivative-related metrics in particular. In this paper, we focus on understanding the impact of various error-controlled lossy compressors on multiple derivative-related metrics commonly concerned by users. We perform solid experiments that involve 5 state-of-the-art lossy compressors and 4 real-world application datasets. We summarize 5 valuable takeaways, which can shed some light in understanding the impact of lossy compression on derivative- related metrics.

Original languageEnglish (US)
Title of host publicationProceedings of DRBSD-8 2022
Subtitle of host publication8th International Workshop on Data Analysis and Reduction for Big Scientific Data, Held in conjunction with SC 2022: The International Conference for High Performance Computing, Networking, Storage and Analysis
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages44-53
Number of pages10
ISBN (Electronic)9781665463379
DOIs
StatePublished - 2022
Event8th IEEE/ACM International Workshop on Data Analysis and Reduction for Big Scientific Data, DRBSD-8 2022 - Dallas, United States
Duration: Nov 13 2022Nov 18 2022

Publication series

NameProceedings of DRBSD-8 2022: 8th International Workshop on Data Analysis and Reduction for Big Scientific Data, Held in conjunction with SC 2022: The International Conference for High Performance Computing, Networking, Storage and Analysis

Conference

Conference8th IEEE/ACM International Workshop on Data Analysis and Reduction for Big Scientific Data, DRBSD-8 2022
Country/TerritoryUnited States
CityDallas
Period11/13/2211/18/22

Keywords

  • Data Reduction
  • Derivative
  • HPC
  • Lossy Compression

ASJC Scopus subject areas

  • Computer Science Applications
  • Information Systems
  • Earth and Planetary Sciences (miscellaneous)

Fingerprint

Dive into the research topics of 'Understanding Impact of Lossy Compression on Derivative-related Metrics in Scientific Datasets'. Together they form a unique fingerprint.

Cite this