Streaming Analytics with Adaptive Near-data Processing

Atul Sandur, Chan Ho Park, Stavros Volos, Gul Agha, Myeongjae Jeon

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Streaming analytics applications need to process massive volumes of data in a timely manner, in domains ranging from datacenter telemetry and geo-distributed log analytics to Internet-of-Things systems. Such applications suffer from significant network transfer costs to transport the data to a stream processor and compute costs to analyze the data in a timely manner. Pushing the computation closer to the data source by partitioning the analytics query is an effective strategy to reduce resource costs for the stream processor. However, the partitioning strategy depends on the nature of resource bottleneck and resource variability that is encountered at the compute resources near the data source. In this paper, we investigate different issues which affect query partitioning strategies. We first study new partitioning techniques within cloud datacenters which operate under constrained compute conditions varying widely across data sources and different time slots. With insights obtained from the study, we suggest several different ways to improve the performance of stream analytics applications operating in different resource environments, by making effective partitioning decisions for a variety of use cases such as geo-distributed streaming analytics.

Original languageEnglish (US)
Title of host publicationWWW 2022 - Companion Proceedings of the Web Conference 2022
PublisherAssociation for Computing Machinery
Pages563-566
Number of pages4
ISBN (Electronic)9781450391306
DOIs
StatePublished - Apr 25 2022
Event31st ACM Web Conference, WWW 2022 - Virtual, Online, France
Duration: Apr 25 2022 → …

Publication series

NameWWW 2022 - Companion Proceedings of the Web Conference 2022

Conference

Conference31st ACM Web Conference, WWW 2022
Country/TerritoryFrance
CityVirtual, Online
Period4/25/22 → …

Keywords

  • Datacenter monitoring
  • Edge computing
  • Query partitioning
  • Streaming analytics
  • Wide area network

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Software

Fingerprint

Dive into the research topics of 'Streaming Analytics with Adaptive Near-data Processing'. Together they form a unique fingerprint.

Cite this