TY - GEN
T1 - MemXCT
T2 - 2019 International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2019
AU - Hidayetolu, Mert
AU - Biçer, Tekin
AU - De Gonzalo, Simon Garcia
AU - Ren, Bin
AU - Gürsoy, Doa
AU - Kettimuthu, Rajkumar
AU - Foster, Ian T.
AU - Hwu, Wen Mei W.
N1 - Publisher Copyright:
© 2019 ACM.
PY - 2019/11/17
Y1 - 2019/11/17
N2 - X-ray computed tomography (XCT)is used regularly at synchrotron light sources to study the internal morphology of materials at high resolution. However, experimental constraints, such as radiation sensitivity, can result in noisy or undersampled measurements. Further, depending on the resolution, sample size and data acquisition rates, the resulting noisy dataset can be terabyte-scale. Advanced iterative reconstruction techniques can produce high-quality images from noisy measurements, but their computational requirements have made their use exception rather than the rule. We propose here a novel memory-centric approach that avoids redundant computations at the expense of additional memory complexity. We develop a system, MemXCT, that uses an optimized SpMV implementation with two-level pseudo-Hilbert ordering and multi-stage input buffering. We evaluate MemXCT on various supercomputer architectures incolving KNL and GPU. MemXCT can reconstruct a large (11K×11K) mouse brain tomogram in ∼10 seconds using 4096 KNL nodes (256K cores), the largest iterative reconstruction achieved in near-real time.
AB - X-ray computed tomography (XCT)is used regularly at synchrotron light sources to study the internal morphology of materials at high resolution. However, experimental constraints, such as radiation sensitivity, can result in noisy or undersampled measurements. Further, depending on the resolution, sample size and data acquisition rates, the resulting noisy dataset can be terabyte-scale. Advanced iterative reconstruction techniques can produce high-quality images from noisy measurements, but their computational requirements have made their use exception rather than the rule. We propose here a novel memory-centric approach that avoids redundant computations at the expense of additional memory complexity. We develop a system, MemXCT, that uses an optimized SpMV implementation with two-level pseudo-Hilbert ordering and multi-stage input buffering. We evaluate MemXCT on various supercomputer architectures incolving KNL and GPU. MemXCT can reconstruct a large (11K×11K) mouse brain tomogram in ∼10 seconds using 4096 KNL nodes (256K cores), the largest iterative reconstruction achieved in near-real time.
UR - http://www.scopus.com/inward/record.url?scp=85076139062&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85076139062&partnerID=8YFLogxK
U2 - 10.1145/3295500.3356220
DO - 10.1145/3295500.3356220
M3 - Conference contribution
AN - SCOPUS:85076139062
T3 - International Conference for High Performance Computing, Networking, Storage and Analysis, SC
BT - Proceedings of SC 2019
PB - IEEE Computer Society
Y2 - 17 November 2019 through 22 November 2019
ER -