An Efficient and Accurate Compression Ratio Estimation Model for SZx

Arham Khan, Sheng Di, Kai Zhao, Jinyang Liu, Kyle Chard, Ian Fosterv, Franck Cappello

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Modern large-scale HPC applications generate enormous volumes of data for subsequent storage or transfer. Error-controlled lossy compression effectively reduces data sizes and preserves data fidelity based on user-defined error bounds, but compression ratios are unknown until after compression. Many use cases, however, require knowledge of compression ratios a priori to pre-allocate memory for the compressed data at runtime and avoid simulation crashes caused by lack of storage space. We propose Surrogate-based Error-controlled Lossy Compression Ratio Estimation Framework (SECRE), which estimates the true compression ratio via data sampling and a lightweight compression surrogate. Results for SZx on 4 real-world scientific datasets show an extremely low estimation error (e.g., ~1% estimation errors for SZx) and low execution overhead (e.g., ~2% estimation cost for SZx).

Original languageEnglish (US)
Title of host publicationProceedings - 2023 IEEE International Conference on Cluster Computing Workshops and Posters, CLUSTER Workshops 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages48-49
Number of pages2
ISBN (Electronic)9798350370621
DOIs
StatePublished - 2023
Externally publishedYes
Event25th IEEE International Conference on Cluster Computing Workshops, CLUSTER Workshops 2023 - Santa Fe, United States
Duration: Oct 31 2023Nov 3 2023

Publication series

NameProceedings - IEEE International Conference on Cluster Computing, ICCC
ISSN (Print)1552-5244

Conference

Conference25th IEEE International Conference on Cluster Computing Workshops, CLUSTER Workshops 2023
Country/TerritoryUnited States
CitySanta Fe
Period10/31/2311/3/23

Keywords

  • compression ratio estimation
  • error-controlled lossy compression
  • sampling
  • scientific datasets

ASJC Scopus subject areas

  • Software
  • Hardware and Architecture
  • Signal Processing

Fingerprint

Dive into the research topics of 'An Efficient and Accurate Compression Ratio Estimation Model for SZx'. Together they form a unique fingerprint.

Cite this