SECRE: Surrogate-Based Error-Controlled Lossy Compression Ratio Estimation Framework

Arham Khan, Sheng Di, Kai Zhao, Jinyang Liu, Kyle Chard, Ian Foster, Franck Cappello

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Error-controlled lossy compression has been effective in reducing data storage/transfer costs while preserving reconstructed data fidelity based on user-defined error bounds. State-of-the-art error-controlled lossy compressors primarily fo-cus on error control rather than compression size, and thus, compression ratios are unknown until the compression operation is fully completed. Many use cases, however, require knowledge of compression ratios a priori, for example, pre-allocating appropri-ate memory for the compressed data at runtime. In this paper, we propose a novel, efficient Surrogate-based Error-controlled Lossy Compression Ratio Estimation Framework (SECRE), which includes three key features/contributions. (1) We carefully design the SECRE framework, which, in principle, can be applied to different error-bounded lossy compressors. (2) We implement a compression ratio estimation method for four state-of-the-art error-controlled lossy compressors-SZx, SZ3, ZFP, and SPERR-by devising a corresponding lightweight compression surrogate for each. (3) We evaluate the performance and accuracy of SECRE using four real-world scientific simulation datasets. Experiments show that SECREcan obtain highly accurate com-pression ratio estimates (e.g., 1 % estimation errors for SZx) with low execution overhead (e.g., 2 % estimation cost for SZx).

Original languageEnglish (US)
Title of host publicationProceedings - 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics, HiPC 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages132-142
Number of pages11
ISBN (Electronic)9798350383225
DOIs
StatePublished - 2023
Event30th Annual IEEE International Conference on High Performance Computing, Data, and Analytics, HiPC 2023 - Goa, India
Duration: Dec 18 2023Dec 21 2023

Publication series

NameProceedings - 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics, HiPC 2023

Conference

Conference30th Annual IEEE International Conference on High Performance Computing, Data, and Analytics, HiPC 2023
Country/TerritoryIndia
CityGoa
Period12/18/2312/21/23

Keywords

  • compression ratio estimation
  • error-controlled lossy compression
  • sampling
  • scientific datasets

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Science Applications
  • Hardware and Architecture
  • Information Systems
  • Information Systems and Management

Fingerprint

Dive into the research topics of 'SECRE: Surrogate-Based Error-Controlled Lossy Compression Ratio Estimation Framework'. Together they form a unique fingerprint.

Cite this