Performance optimization on model synchronization in parallel stochastic gradient descent based SVM

Vibhatha Lakmal Abeykoon, Geoffrey Fox, Minje Kim

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Understanding the bottlenecks in implementing stochastic gradient descent (SGD)-based distributed support vector machines (SVM) algorithm is important in training larger data sets. The communication time to do the model synchronization across the parallel processes is the main bottleneck that causes inefficiency in the training process. The model synchronization is directly affected by the mini-batch size of data processed before the global synchronization. In producing an efficient distributed model, the communication time in training model synchronization has to be as minimum as possible while retaining a high testing accuracy. The effect from model synchronization frequency over the convergence of the algorithm and accuracy of the generated model must be well understood to design an efficient distributed model. In this research, we identify the bottlenecks in model synchronization in parallel stochastic gradient descent (PSGD)-based SVM algorithm with respect to the training model synchronization frequency (MSF). Our research shows that by optimizing the MSF in the data sets that we used, a reduction of 98% in communication time can be gained (16x - 24x speed up) with respect to high-frequency model synchronization. The training model optimization discussed in this paper guarantees a higher accuracy than the sequential algorithm along with faster convergence.

Original languageEnglish (US)
Title of host publicationProceedings - 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGrid 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages508-517
Number of pages10
ISBN (Electronic)9781728109121
DOIs
StatePublished - May 2019
Externally publishedYes
Event19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGrid 2019 - Larnaca, Cyprus
Duration: May 14 2019May 17 2019

Publication series

NameProceedings - 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGrid 2019

Conference

Conference19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGrid 2019
Country/TerritoryCyprus
CityLarnaca
Period5/14/195/17/19

Keywords

  • Distributed communication optimization
  • Model synchronization
  • Scaling SVM
  • Sgd
  • SVM

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Information Systems
  • Information Systems and Management
  • Management Science and Operations Research
  • Health Informatics

Fingerprint

Dive into the research topics of 'Performance optimization on model synchronization in parallel stochastic gradient descent based SVM'. Together they form a unique fingerprint.

Cite this