TY - GEN
T1 - Is Function-as-a-Service a Good Fit for Latency-Critical Services?
AU - Qiu, Haoran
AU - Jha, Saurabh
AU - Banerjee, Subho S.
AU - Patke, Archit
AU - Wang, Chen
AU - Hubertus, Franke
AU - Kalbarczyk, Zbigniew T.
AU - Iyer, Ravishankar K.
N1 - Funding Information:
We thank the anonymous reviewers for their valuable comments that improved the paper. We appreciate J. Applequist and K. Saboo for their insightful comments on the early drafts of this manuscript. This work is partially supported by the National Science Foundation (NSF) under grant No. CCF 20-29049; by the IBM-ILLINOIS Center for Cognitive Computing Systems Research (C3SR), a research collaboration that is part of the IBM AI Horizon Network; and by the IBM-ILLINOIS Discovery Accelerator Institute (IIDAI). Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the NSF or IBM. Saurabh Jha is supported by a 2020 IBM PhD fellowship.
Publisher Copyright:
© 2021 ACM.
PY - 2021/12/6
Y1 - 2021/12/6
N2 - Function-as-a-Service (FaaS) is becoming an increasingly popular cloud-deployment paradigm for serverless computing that frees application developers from managing the infrastructure. At the same time, it allows cloud providers to assert control in workload consolidation, i.e., co-locating multiple containers on the same server, thereby achieving higher server utilization, often at the cost of higher end-to-end function request latency. Interestingly, a key aspect of serverless latency management has not been well studied: the trade-off between application developers' latency goals and the FaaS providers' utilization goals. This paper presents a multi-faceted, measurement-driven study of latency variation in serverless platforms that elucidates this trade-off space. We obtained production measurements by executing FaaS benchmarks on IBM Cloud and a private cloud to study the impact of workload consolidation, queuing delay, and cold starts on the end-to-end function request latency. We draw several conclusions from the characterization results. For example, increasing a container's allocated memory limit from 128 MB to 256 MB reduces the tail latency by 2× but has 1.75× higher power consumption and 59% lower CPU utilization.
AB - Function-as-a-Service (FaaS) is becoming an increasingly popular cloud-deployment paradigm for serverless computing that frees application developers from managing the infrastructure. At the same time, it allows cloud providers to assert control in workload consolidation, i.e., co-locating multiple containers on the same server, thereby achieving higher server utilization, often at the cost of higher end-to-end function request latency. Interestingly, a key aspect of serverless latency management has not been well studied: the trade-off between application developers' latency goals and the FaaS providers' utilization goals. This paper presents a multi-faceted, measurement-driven study of latency variation in serverless platforms that elucidates this trade-off space. We obtained production measurements by executing FaaS benchmarks on IBM Cloud and a private cloud to study the impact of workload consolidation, queuing delay, and cold starts on the end-to-end function request latency. We draw several conclusions from the characterization results. For example, increasing a container's allocated memory limit from 128 MB to 256 MB reduces the tail latency by 2× but has 1.75× higher power consumption and 59% lower CPU utilization.
KW - Function-as-a-Service
KW - Multi-tenancy
KW - Performance Modeling
KW - Resource Management
KW - Serverless Computing
UR - http://www.scopus.com/inward/record.url?scp=85121448756&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85121448756&partnerID=8YFLogxK
U2 - 10.1145/3493651.3493666
DO - 10.1145/3493651.3493666
M3 - Conference contribution
AN - SCOPUS:85121448756
T3 - Proceedings of the 7th International Workshop on Serverless Computing, WoSC 2021
SP - 1
EP - 8
BT - Proceedings of the 7th International Workshop on Serverless Computing, WoSC 2021
PB - Association for Computing Machinery
T2 - 7th International Workshop on Serverless Computing, WoSC 2021
Y2 - 6 December 2021
ER -