TY - GEN
T1 - Communication-Efficient Decentralized Local SGD over Undirected Networks
AU - Qin, Tiancheng
AU - Etesami, S. Rasoul
AU - Uribe, Cesar A.
N1 - Funding Information:
This work is supported by the NSF CAREER Award under Grant No. EPCN-1944403. Online version available at [1].
Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - We consider the distributed learning problem where a network of n agents seeks to minimize a global function F. Agents have access to F through noisy gradients, and they can locally communicate with their neighbors over an undirected network. We study the Decentralized Local SGD method, where agents perform a number of local gradient steps and occasionally exchange information with their neighbors. Previous algorithmic analysis efforts have focused on the specific network topology (star topology), where a leader node aggregates all agents' information. We generalize that setting to an arbitrary undirected network by analyzing the trade-off between the number of communication rounds and the computational effort of each agent. We bound the expected optimality gap in terms of the number of iterates T, the number of workers n, and the spectral gap of the underlying network. Our main results show that by using only R = Ω(n) communication rounds, one can achieve an error that scales as O(1/nT), where the number of communication rounds is independent of T and only depends on the number of agents.
AB - We consider the distributed learning problem where a network of n agents seeks to minimize a global function F. Agents have access to F through noisy gradients, and they can locally communicate with their neighbors over an undirected network. We study the Decentralized Local SGD method, where agents perform a number of local gradient steps and occasionally exchange information with their neighbors. Previous algorithmic analysis efforts have focused on the specific network topology (star topology), where a leader node aggregates all agents' information. We generalize that setting to an arbitrary undirected network by analyzing the trade-off between the number of communication rounds and the computational effort of each agent. We bound the expected optimality gap in terms of the number of iterates T, the number of workers n, and the spectral gap of the underlying network. Our main results show that by using only R = Ω(n) communication rounds, one can achieve an error that scales as O(1/nT), where the number of communication rounds is independent of T and only depends on the number of agents.
UR - http://www.scopus.com/inward/record.url?scp=85126017070&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85126017070&partnerID=8YFLogxK
U2 - 10.1109/CDC45484.2021.9683272
DO - 10.1109/CDC45484.2021.9683272
M3 - Conference contribution
AN - SCOPUS:85126017070
T3 - Proceedings of the IEEE Conference on Decision and Control
SP - 3361
EP - 3366
BT - 60th IEEE Conference on Decision and Control, CDC 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 60th IEEE Conference on Decision and Control, CDC 2021
Y2 - 13 December 2021 through 17 December 2021
ER -