TY - GEN
T1 - Advanced parallelization strategies using hybrid MPI-CUDA DSMC method for modeling flow through porous media
AU - Jambunathan, Revathi
AU - Levin, Deborah A.
N1 - Publisher Copyright:
© 2016, American Institute of Aeronautics and Astronautics Inc, AIAA. All right reserved.
PY - 2016
Y1 - 2016
N2 - The gridfree octree DSMC approach is parallelized with GPGPUs using CUDA. A linear space filling Morton Z-curve is employed to represent the three dimensional octree structure in a linear array and the advantages of this implementation is discussed. The methodology for a multi-GPU, hybrid MPI-CUDA implementation, involving load balanced domain decomposition and inter-GPU communications is presented. External flow at 1 atm over a fractal-like spherical aggregate is modeled and strong scaling studies for the test case is analysed. The analysis showed that the CHAOS DSMC solver, with 8 GPUs was 92% efficient, while the 16 GPU case was not partitioned equally between all the 16 GPUs. Argon gas flow through a fibrous microstruture modeled with 1.5 million triangular panels was also modeled and showed good scaling. Material permeability of the microstructure was computed to be equal to 253 ×10-12m2.
AB - The gridfree octree DSMC approach is parallelized with GPGPUs using CUDA. A linear space filling Morton Z-curve is employed to represent the three dimensional octree structure in a linear array and the advantages of this implementation is discussed. The methodology for a multi-GPU, hybrid MPI-CUDA implementation, involving load balanced domain decomposition and inter-GPU communications is presented. External flow at 1 atm over a fractal-like spherical aggregate is modeled and strong scaling studies for the test case is analysed. The analysis showed that the CHAOS DSMC solver, with 8 GPUs was 92% efficient, while the 16 GPU case was not partitioned equally between all the 16 GPUs. Argon gas flow through a fibrous microstruture modeled with 1.5 million triangular panels was also modeled and showed good scaling. Material permeability of the microstructure was computed to be equal to 253 ×10-12m2.
UR - http://www.scopus.com/inward/record.url?scp=84980351854&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84980351854&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84980351854
SN - 9781624104350
T3 - 46th AIAA Thermophysics Conference
BT - 46th AIAA Thermophysics Conference
PB - American Institute of Aeronautics and Astronautics Inc, AIAA
T2 - 46th AIAA Thermophysics Conference, 2016
Y2 - 13 June 2016 through 17 June 2016
ER -