Stela: Enabling stream processing systems to scale-in and scale-out on-demand

Le Xu, Boyang Peng, Indranil Gupta

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The era of big data has led to the emergence of new real-time distributed stream processing engines like Apache Storm. We present Stela (STream processing ELAsticity), a stream processing system that supports scale-out and scale-in operations in an on-demand manner, i.e., when the user requests such a scaling operation. Stela meets two goals: 1) it optimizes post-scaling throughput, and 2) it minimizes interruption to the ongoing computation while the scaling operation is being carried out. We have integrated Stela into Apache Storm. We present experimental results using micro-benchmark Storm applications, as well as production applications from industry (Yahoo! Inc. and IBM). Our experiments show that compared to Apache Storm's default scheduler, Stela's scale-out operation achieves throughput that is 21-120% higher, and interruption time that is significantly smaller. Stela's scale-in operation chooses the right set of servers to remove and achieves 2X-5X higher throughput than Storm's default strategy.

Original languageEnglish (US)
Title of host publicationProceedings - 2016 IEEE International Conference on Cloud Engineering, IC2E 2016
Subtitle of host publicationCo-located with the 1st IEEE International Conference on Internet-of-Things Design and Implementation, IoTDI 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages22-31
Number of pages10
ISBN (Electronic)9781509019618
DOIs
StatePublished - Jun 1 2016
Event4th IEEE Annual International Conference on Cloud Engineering, IC2E 2016 - Berlin, Germany
Duration: Apr 4 2016Apr 8 2016

Publication series

NameProceedings - 2016 IEEE International Conference on Cloud Engineering, IC2E 2016: Co-located with the 1st IEEE International Conference on Internet-of-Things Design and Implementation, IoTDI 2016

Other

Other4th IEEE Annual International Conference on Cloud Engineering, IC2E 2016
CountryGermany
CityBerlin
Period4/4/164/8/16

Fingerprint

Elasticity
Processing
Throughput
Servers
Engines
Industry
Experiments

Keywords

  • Distributed Systems
  • Elasticity
  • Scalability
  • Stream Processing

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Computer Networks and Communications

Cite this

Xu, L., Peng, B., & Gupta, I. (2016). Stela: Enabling stream processing systems to scale-in and scale-out on-demand. In Proceedings - 2016 IEEE International Conference on Cloud Engineering, IC2E 2016: Co-located with the 1st IEEE International Conference on Internet-of-Things Design and Implementation, IoTDI 2016 (pp. 22-31). [7484160] (Proceedings - 2016 IEEE International Conference on Cloud Engineering, IC2E 2016: Co-located with the 1st IEEE International Conference on Internet-of-Things Design and Implementation, IoTDI 2016). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IC2E.2016.38

Stela : Enabling stream processing systems to scale-in and scale-out on-demand. / Xu, Le; Peng, Boyang; Gupta, Indranil.

Proceedings - 2016 IEEE International Conference on Cloud Engineering, IC2E 2016: Co-located with the 1st IEEE International Conference on Internet-of-Things Design and Implementation, IoTDI 2016. Institute of Electrical and Electronics Engineers Inc., 2016. p. 22-31 7484160 (Proceedings - 2016 IEEE International Conference on Cloud Engineering, IC2E 2016: Co-located with the 1st IEEE International Conference on Internet-of-Things Design and Implementation, IoTDI 2016).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Xu, L, Peng, B & Gupta, I 2016, Stela: Enabling stream processing systems to scale-in and scale-out on-demand. in Proceedings - 2016 IEEE International Conference on Cloud Engineering, IC2E 2016: Co-located with the 1st IEEE International Conference on Internet-of-Things Design and Implementation, IoTDI 2016., 7484160, Proceedings - 2016 IEEE International Conference on Cloud Engineering, IC2E 2016: Co-located with the 1st IEEE International Conference on Internet-of-Things Design and Implementation, IoTDI 2016, Institute of Electrical and Electronics Engineers Inc., pp. 22-31, 4th IEEE Annual International Conference on Cloud Engineering, IC2E 2016, Berlin, Germany, 4/4/16. https://doi.org/10.1109/IC2E.2016.38
Xu L, Peng B, Gupta I. Stela: Enabling stream processing systems to scale-in and scale-out on-demand. In Proceedings - 2016 IEEE International Conference on Cloud Engineering, IC2E 2016: Co-located with the 1st IEEE International Conference on Internet-of-Things Design and Implementation, IoTDI 2016. Institute of Electrical and Electronics Engineers Inc. 2016. p. 22-31. 7484160. (Proceedings - 2016 IEEE International Conference on Cloud Engineering, IC2E 2016: Co-located with the 1st IEEE International Conference on Internet-of-Things Design and Implementation, IoTDI 2016). https://doi.org/10.1109/IC2E.2016.38
Xu, Le ; Peng, Boyang ; Gupta, Indranil. / Stela : Enabling stream processing systems to scale-in and scale-out on-demand. Proceedings - 2016 IEEE International Conference on Cloud Engineering, IC2E 2016: Co-located with the 1st IEEE International Conference on Internet-of-Things Design and Implementation, IoTDI 2016. Institute of Electrical and Electronics Engineers Inc., 2016. pp. 22-31 (Proceedings - 2016 IEEE International Conference on Cloud Engineering, IC2E 2016: Co-located with the 1st IEEE International Conference on Internet-of-Things Design and Implementation, IoTDI 2016).
@inproceedings{cacd4aa67ffa460491acce31c8d3cf59,
title = "Stela: Enabling stream processing systems to scale-in and scale-out on-demand",
abstract = "The era of big data has led to the emergence of new real-time distributed stream processing engines like Apache Storm. We present Stela (STream processing ELAsticity), a stream processing system that supports scale-out and scale-in operations in an on-demand manner, i.e., when the user requests such a scaling operation. Stela meets two goals: 1) it optimizes post-scaling throughput, and 2) it minimizes interruption to the ongoing computation while the scaling operation is being carried out. We have integrated Stela into Apache Storm. We present experimental results using micro-benchmark Storm applications, as well as production applications from industry (Yahoo! Inc. and IBM). Our experiments show that compared to Apache Storm's default scheduler, Stela's scale-out operation achieves throughput that is 21-120{\%} higher, and interruption time that is significantly smaller. Stela's scale-in operation chooses the right set of servers to remove and achieves 2X-5X higher throughput than Storm's default strategy.",
keywords = "Distributed Systems, Elasticity, Scalability, Stream Processing",
author = "Le Xu and Boyang Peng and Indranil Gupta",
year = "2016",
month = "6",
day = "1",
doi = "10.1109/IC2E.2016.38",
language = "English (US)",
series = "Proceedings - 2016 IEEE International Conference on Cloud Engineering, IC2E 2016: Co-located with the 1st IEEE International Conference on Internet-of-Things Design and Implementation, IoTDI 2016",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
pages = "22--31",
booktitle = "Proceedings - 2016 IEEE International Conference on Cloud Engineering, IC2E 2016",
address = "United States",

}

TY - GEN

T1 - Stela

T2 - Enabling stream processing systems to scale-in and scale-out on-demand

AU - Xu, Le

AU - Peng, Boyang

AU - Gupta, Indranil

PY - 2016/6/1

Y1 - 2016/6/1

N2 - The era of big data has led to the emergence of new real-time distributed stream processing engines like Apache Storm. We present Stela (STream processing ELAsticity), a stream processing system that supports scale-out and scale-in operations in an on-demand manner, i.e., when the user requests such a scaling operation. Stela meets two goals: 1) it optimizes post-scaling throughput, and 2) it minimizes interruption to the ongoing computation while the scaling operation is being carried out. We have integrated Stela into Apache Storm. We present experimental results using micro-benchmark Storm applications, as well as production applications from industry (Yahoo! Inc. and IBM). Our experiments show that compared to Apache Storm's default scheduler, Stela's scale-out operation achieves throughput that is 21-120% higher, and interruption time that is significantly smaller. Stela's scale-in operation chooses the right set of servers to remove and achieves 2X-5X higher throughput than Storm's default strategy.

AB - The era of big data has led to the emergence of new real-time distributed stream processing engines like Apache Storm. We present Stela (STream processing ELAsticity), a stream processing system that supports scale-out and scale-in operations in an on-demand manner, i.e., when the user requests such a scaling operation. Stela meets two goals: 1) it optimizes post-scaling throughput, and 2) it minimizes interruption to the ongoing computation while the scaling operation is being carried out. We have integrated Stela into Apache Storm. We present experimental results using micro-benchmark Storm applications, as well as production applications from industry (Yahoo! Inc. and IBM). Our experiments show that compared to Apache Storm's default scheduler, Stela's scale-out operation achieves throughput that is 21-120% higher, and interruption time that is significantly smaller. Stela's scale-in operation chooses the right set of servers to remove and achieves 2X-5X higher throughput than Storm's default strategy.

KW - Distributed Systems

KW - Elasticity

KW - Scalability

KW - Stream Processing

UR - http://www.scopus.com/inward/record.url?scp=84978062188&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84978062188&partnerID=8YFLogxK

U2 - 10.1109/IC2E.2016.38

DO - 10.1109/IC2E.2016.38

M3 - Conference contribution

AN - SCOPUS:84978062188

T3 - Proceedings - 2016 IEEE International Conference on Cloud Engineering, IC2E 2016: Co-located with the 1st IEEE International Conference on Internet-of-Things Design and Implementation, IoTDI 2016

SP - 22

EP - 31

BT - Proceedings - 2016 IEEE International Conference on Cloud Engineering, IC2E 2016

PB - Institute of Electrical and Electronics Engineers Inc.

ER -