Characterization of operational failures from a business data processing SaaS platform

Catello Di Martino, Zbigniew Kalbarczyk, Ravishankar K. Iyer, Geetika Goel, Santonu Sarkar, Rajeshwari Ganesan

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper characterizes operational failures of a production Custom Package Good Software-as-a-Service (SaaS) plat-form. Events log collected over 283 days of in-field oper-ation are used to characterize platform failures. The char-acterization is performed by estimating (i) common failure types of the platform, (ii) key factors impacting platform failures, (iii) failure rate, and (iv) how user workload (files submitted for processing) impacts on the failure rate. The major findings are: (i) 34.1% of failures are caused by un-expected values in customers' data, (ii) nearly 33% of the failures are because of timeout, and (iii) the failure rate in-creases if the workload intensity (transactions/second) in-creases, while there is no statistical evidence of being in u-enced by the workload volume (size of users' data). Finally, the paper presents the lessons learned and how the findings and the implemented analysis tool allow platform develop-ers to improve platform code, system settings and customer management.

Original languageEnglish (US)
Title of host publication36th International Conference on Software Engineering, ICSE Companion 2014 - Proceedings
PublisherAssociation for Computing Machinery
Pages195-204
Number of pages10
ISBN (Print)9781450327688
DOIs
StatePublished - 2014
Event36th International Conference on Software Engineering, ICSE 2014 - Hyderabad, India
Duration: May 31 2014Jun 7 2014

Publication series

Name36th International Conference on Software Engineering, ICSE Companion 2014 - Proceedings

Other

Other36th International Conference on Software Engineering, ICSE 2014
CountryIndia
CityHyderabad
Period5/31/146/7/14

Keywords

  • Cloud computing
  • Failure analysis
  • Logs
  • Robustness
  • SaaS

ASJC Scopus subject areas

  • Software

Fingerprint Dive into the research topics of 'Characterization of operational failures from a business data processing SaaS platform'. Together they form a unique fingerprint.

Cite this