TY - GEN
T1 - File System Semantics Requirements of HPC Applications
AU - Wang, Chen
AU - Mohror, Kathryn
AU - Snir, Marc
N1 - Publisher Copyright:
© 2020 ACM.
PY - 2021/6/21
Y1 - 2021/6/21
N2 - Most widely-deployed parallel file systems (PFSs) implement POSIX semantics, which implies sequential consistency for reads and writes. Strict adherence to POSIX semantics is known to impede performance and thus several new PFSs with relaxed consistency semantics and better performance have been introduced. Such PFSs are useful provided that applications can run correctly on a PFS with weaker semantics. While it is widely assumed that HPC applications do not require strict POSIX semantics, to our knowledge there has not been systematic work to support this assumption. In this paper, we address this gap with a categorization of the consistency semantics guarantees of PFSs and develop an algorithm to determine the consistency semantics requirements of a variety of HPC applications. We captured the I/O activity of 17 representative HPC applications and benchmarks as they performed I/O through POSIX or I/O libraries and examined the metadata operations used and their file access patterns. From this analysis, we find that 16 of the 17 applications can utilize PFSs with weaker semantics.
AB - Most widely-deployed parallel file systems (PFSs) implement POSIX semantics, which implies sequential consistency for reads and writes. Strict adherence to POSIX semantics is known to impede performance and thus several new PFSs with relaxed consistency semantics and better performance have been introduced. Such PFSs are useful provided that applications can run correctly on a PFS with weaker semantics. While it is widely assumed that HPC applications do not require strict POSIX semantics, to our knowledge there has not been systematic work to support this assumption. In this paper, we address this gap with a categorization of the consistency semantics guarantees of PFSs and develop an algorithm to determine the consistency semantics requirements of a variety of HPC applications. We captured the I/O activity of 17 representative HPC applications and benchmarks as they performed I/O through POSIX or I/O libraries and examined the metadata operations used and their file access patterns. From this analysis, we find that 16 of the 17 applications can utilize PFSs with weaker semantics.
KW - consistency semantics
KW - parallel file system
KW - scientific applications
UR - http://www.scopus.com/inward/record.url?scp=85109578962&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85109578962&partnerID=8YFLogxK
U2 - 10.1145/3431379.3460637
DO - 10.1145/3431379.3460637
M3 - Conference contribution
AN - SCOPUS:85109578962
T3 - HPDC 2021 - Proceedings of the 30th International Symposium on High-Performance Parallel and Distributed Computing
SP - 19
EP - 30
BT - HPDC 2021 - Proceedings of the 30th International Symposium on High-Performance Parallel and Distributed Computing
PB - Association for Computing Machinery
T2 - 30th International Symposium on High-Performance Parallel and Distributed Computing, HPDC 2021
Y2 - 21 June 2021 through 25 June 2021
ER -