TY - GEN
T1 - Gluon
T2 - 39th ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI 2018
AU - Dathathri, Roshan
AU - Gill, Gurbinder
AU - Hoang, Loc
AU - Dang, Hoang Vu
AU - Brooks, Alex
AU - Dryden, Nikoli
AU - Snir, Marc
AU - Pingali, Keshav
N1 - Publisher Copyright:
© 2018 Association for Computing Machinery.
PY - 2018/6/11
Y1 - 2018/6/11
N2 - This paper introduces a new approach to building distributed-memory graph analytics systems that exploits heterogeneity in processor types (CPU and GPU), partitioning policies, and programming models. The key to this approach is Gluon, a communication-optimizing substrate. Programmers write applications in a shared-memory programming system of their choice and interface these applications with Gluon using a lightweight API. Gluon enables these programs to run on heterogeneous clusters and optimizes communication in a novel way by exploiting structural and temporal invariants of graph partitioning policies. To demonstrate Gluon's ability to support different programming models, we interfaced Gluon with the Galois and Ligra shared-memory graph analytics systems to produce distributed-memory versions of these systems named D-Galois and D-Ligra, respectively. To demonstrate Gluon's ability to support heterogeneous processors, we interfaced Gluon with IrGL, a state-of-the-art single-GPU system for graph analytics, to produce D-IrGL, the first multi-GPU distributed-memory graph analytics system. Our experiments were done on CPU clusters with up to 256 hosts and roughly 70,000 threads and on multi-GPU clusters with up to 64 GPUs. The communication optimizations in Gluon improve end-to-end application execution time by ∼2.6× on the average. D-Galois and D-IrGL scale well and are faster than Gemini, the state-of-the-art distributed CPU graph analytics system, by factors of ∼3.9× and ∼4.9×, respectively, on the average.
AB - This paper introduces a new approach to building distributed-memory graph analytics systems that exploits heterogeneity in processor types (CPU and GPU), partitioning policies, and programming models. The key to this approach is Gluon, a communication-optimizing substrate. Programmers write applications in a shared-memory programming system of their choice and interface these applications with Gluon using a lightweight API. Gluon enables these programs to run on heterogeneous clusters and optimizes communication in a novel way by exploiting structural and temporal invariants of graph partitioning policies. To demonstrate Gluon's ability to support different programming models, we interfaced Gluon with the Galois and Ligra shared-memory graph analytics systems to produce distributed-memory versions of these systems named D-Galois and D-Ligra, respectively. To demonstrate Gluon's ability to support heterogeneous processors, we interfaced Gluon with IrGL, a state-of-the-art single-GPU system for graph analytics, to produce D-IrGL, the first multi-GPU distributed-memory graph analytics system. Our experiments were done on CPU clusters with up to 256 hosts and roughly 70,000 threads and on multi-GPU clusters with up to 64 GPUs. The communication optimizations in Gluon improve end-to-end application execution time by ∼2.6× on the average. D-Galois and D-IrGL scale well and are faster than Gemini, the state-of-the-art distributed CPU graph analytics system, by factors of ∼3.9× and ∼4.9×, respectively, on the average.
KW - Big data
KW - Communication optimizations
KW - Distributed-memory graph analytics
KW - GPUs
KW - Heterogeneous architectures
UR - http://www.scopus.com/inward/record.url?scp=85049586572&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85049586572&partnerID=8YFLogxK
U2 - 10.1145/3192366.3192404
DO - 10.1145/3192366.3192404
M3 - Conference contribution
AN - SCOPUS:85049586572
T3 - Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI)
SP - 752
EP - 768
BT - PLDI 2018 - Proceedings of the 39th ACM SIGPLAN Conference on Programming Language Design and Implementation
A2 - Foster, Jeffrey S.
A2 - Grossman, Dan
A2 - Foster, Jeffrey S.
PB - Association for Computing Machinery
Y2 - 18 June 2018 through 22 June 2018
ER -