Social trove: A self-summarizing storage service for social sensing

Md Tanvir Al Amin, Shen Li, Muntasir Raihan Rahman, Panindra Tumkur Seetharamu, Shiguang Wang, Tarek Abdelzaher, Indranil Gupta, Mudhakar Srivatsa, Raghu Ganti, Reaz Ahmed, Hieu Le

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The increasing availability of smartphones, cameras, and wearables with instant data sharing capabilities, and the exploitation of social networks for information broadcast, heralds a future of real-time information overload. With the growing excess of worldwide streaming data, such as images, geotags, text annotations, and sensory measurements, an increasingly common service will become one of data summarization. The objective of such a service will be to obtain a representative sampling of large data streams at a configurable granularity, in real-time, for subsequent consumption by a range of data-centric applications. This paper describes a general-purpose self-summarizing storage service, called Social Trove, for social sensing applications. The service summarizes data streams from human sources, or sensors in their possession, by hierarchically clustering received information in accordance with an application-specific distance metric. It then serves a sampling of produced clusters at a configurable granularity in response to application queries. While Social Trove is a general service, we illustrate its functionality and evaluate it in the specific context of workloads collected from Twitter. Results show that Social Trove supports a high query throughput, while maintaining a low access latency to the produced real-time application-specific data summaries. As a specific application case-study, we implement a fact-finding service on top of Social Trove.

Original languageEnglish (US)
Title of host publicationProceedings - IEEE International Conference on Autonomic Computing, ICAC 2015
EditorsPhilippe Lalanda, Samuel Kounev, Ada Diaconescu, Lucy Cherkasova
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages41-50
Number of pages10
ISBN (Electronic)9781467369701
DOIs
StatePublished - Sep 14 2015
Event12th IEEE International Conference on Autonomic Computing, ICAC 2015 - Grenoble, France
Duration: Jul 7 2015Jul 10 2015

Publication series

NameProceedings - IEEE International Conference on Autonomic Computing, ICAC 2015

Other

Other12th IEEE International Conference on Autonomic Computing, ICAC 2015
CountryFrance
CityGrenoble
Period7/7/157/10/15

Fingerprint

Sampling
Smartphones
Cameras
Throughput
Availability
Sensors

Keywords

  • Clustering
  • Social Sensing
  • Storage
  • Summarization

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Software
  • Control and Systems Engineering

Cite this

Al Amin, M. T., Li, S., Rahman, M. R., Seetharamu, P. T., Wang, S., Abdelzaher, T., ... Le, H. (2015). Social trove: A self-summarizing storage service for social sensing. In P. Lalanda, S. Kounev, A. Diaconescu, & L. Cherkasova (Eds.), Proceedings - IEEE International Conference on Autonomic Computing, ICAC 2015 (pp. 41-50). [7266933] (Proceedings - IEEE International Conference on Autonomic Computing, ICAC 2015). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICAC.2015.47

Social trove : A self-summarizing storage service for social sensing. / Al Amin, Md Tanvir; Li, Shen; Rahman, Muntasir Raihan; Seetharamu, Panindra Tumkur; Wang, Shiguang; Abdelzaher, Tarek; Gupta, Indranil; Srivatsa, Mudhakar; Ganti, Raghu; Ahmed, Reaz; Le, Hieu.

Proceedings - IEEE International Conference on Autonomic Computing, ICAC 2015. ed. / Philippe Lalanda; Samuel Kounev; Ada Diaconescu; Lucy Cherkasova. Institute of Electrical and Electronics Engineers Inc., 2015. p. 41-50 7266933 (Proceedings - IEEE International Conference on Autonomic Computing, ICAC 2015).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Al Amin, MT, Li, S, Rahman, MR, Seetharamu, PT, Wang, S, Abdelzaher, T, Gupta, I, Srivatsa, M, Ganti, R, Ahmed, R & Le, H 2015, Social trove: A self-summarizing storage service for social sensing. in P Lalanda, S Kounev, A Diaconescu & L Cherkasova (eds), Proceedings - IEEE International Conference on Autonomic Computing, ICAC 2015., 7266933, Proceedings - IEEE International Conference on Autonomic Computing, ICAC 2015, Institute of Electrical and Electronics Engineers Inc., pp. 41-50, 12th IEEE International Conference on Autonomic Computing, ICAC 2015, Grenoble, France, 7/7/15. https://doi.org/10.1109/ICAC.2015.47
Al Amin MT, Li S, Rahman MR, Seetharamu PT, Wang S, Abdelzaher T et al. Social trove: A self-summarizing storage service for social sensing. In Lalanda P, Kounev S, Diaconescu A, Cherkasova L, editors, Proceedings - IEEE International Conference on Autonomic Computing, ICAC 2015. Institute of Electrical and Electronics Engineers Inc. 2015. p. 41-50. 7266933. (Proceedings - IEEE International Conference on Autonomic Computing, ICAC 2015). https://doi.org/10.1109/ICAC.2015.47
Al Amin, Md Tanvir ; Li, Shen ; Rahman, Muntasir Raihan ; Seetharamu, Panindra Tumkur ; Wang, Shiguang ; Abdelzaher, Tarek ; Gupta, Indranil ; Srivatsa, Mudhakar ; Ganti, Raghu ; Ahmed, Reaz ; Le, Hieu. / Social trove : A self-summarizing storage service for social sensing. Proceedings - IEEE International Conference on Autonomic Computing, ICAC 2015. editor / Philippe Lalanda ; Samuel Kounev ; Ada Diaconescu ; Lucy Cherkasova. Institute of Electrical and Electronics Engineers Inc., 2015. pp. 41-50 (Proceedings - IEEE International Conference on Autonomic Computing, ICAC 2015).
@inproceedings{2985bd54c00e42f7a7175ac872fd91ef,
title = "Social trove: A self-summarizing storage service for social sensing",
abstract = "The increasing availability of smartphones, cameras, and wearables with instant data sharing capabilities, and the exploitation of social networks for information broadcast, heralds a future of real-time information overload. With the growing excess of worldwide streaming data, such as images, geotags, text annotations, and sensory measurements, an increasingly common service will become one of data summarization. The objective of such a service will be to obtain a representative sampling of large data streams at a configurable granularity, in real-time, for subsequent consumption by a range of data-centric applications. This paper describes a general-purpose self-summarizing storage service, called Social Trove, for social sensing applications. The service summarizes data streams from human sources, or sensors in their possession, by hierarchically clustering received information in accordance with an application-specific distance metric. It then serves a sampling of produced clusters at a configurable granularity in response to application queries. While Social Trove is a general service, we illustrate its functionality and evaluate it in the specific context of workloads collected from Twitter. Results show that Social Trove supports a high query throughput, while maintaining a low access latency to the produced real-time application-specific data summaries. As a specific application case-study, we implement a fact-finding service on top of Social Trove.",
keywords = "Clustering, Social Sensing, Storage, Summarization",
author = "{Al Amin}, {Md Tanvir} and Shen Li and Rahman, {Muntasir Raihan} and Seetharamu, {Panindra Tumkur} and Shiguang Wang and Tarek Abdelzaher and Indranil Gupta and Mudhakar Srivatsa and Raghu Ganti and Reaz Ahmed and Hieu Le",
year = "2015",
month = "9",
day = "14",
doi = "10.1109/ICAC.2015.47",
language = "English (US)",
series = "Proceedings - IEEE International Conference on Autonomic Computing, ICAC 2015",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
pages = "41--50",
editor = "Philippe Lalanda and Samuel Kounev and Ada Diaconescu and Lucy Cherkasova",
booktitle = "Proceedings - IEEE International Conference on Autonomic Computing, ICAC 2015",
address = "United States",

}

TY - GEN

T1 - Social trove

T2 - A self-summarizing storage service for social sensing

AU - Al Amin, Md Tanvir

AU - Li, Shen

AU - Rahman, Muntasir Raihan

AU - Seetharamu, Panindra Tumkur

AU - Wang, Shiguang

AU - Abdelzaher, Tarek

AU - Gupta, Indranil

AU - Srivatsa, Mudhakar

AU - Ganti, Raghu

AU - Ahmed, Reaz

AU - Le, Hieu

PY - 2015/9/14

Y1 - 2015/9/14

N2 - The increasing availability of smartphones, cameras, and wearables with instant data sharing capabilities, and the exploitation of social networks for information broadcast, heralds a future of real-time information overload. With the growing excess of worldwide streaming data, such as images, geotags, text annotations, and sensory measurements, an increasingly common service will become one of data summarization. The objective of such a service will be to obtain a representative sampling of large data streams at a configurable granularity, in real-time, for subsequent consumption by a range of data-centric applications. This paper describes a general-purpose self-summarizing storage service, called Social Trove, for social sensing applications. The service summarizes data streams from human sources, or sensors in their possession, by hierarchically clustering received information in accordance with an application-specific distance metric. It then serves a sampling of produced clusters at a configurable granularity in response to application queries. While Social Trove is a general service, we illustrate its functionality and evaluate it in the specific context of workloads collected from Twitter. Results show that Social Trove supports a high query throughput, while maintaining a low access latency to the produced real-time application-specific data summaries. As a specific application case-study, we implement a fact-finding service on top of Social Trove.

AB - The increasing availability of smartphones, cameras, and wearables with instant data sharing capabilities, and the exploitation of social networks for information broadcast, heralds a future of real-time information overload. With the growing excess of worldwide streaming data, such as images, geotags, text annotations, and sensory measurements, an increasingly common service will become one of data summarization. The objective of such a service will be to obtain a representative sampling of large data streams at a configurable granularity, in real-time, for subsequent consumption by a range of data-centric applications. This paper describes a general-purpose self-summarizing storage service, called Social Trove, for social sensing applications. The service summarizes data streams from human sources, or sensors in their possession, by hierarchically clustering received information in accordance with an application-specific distance metric. It then serves a sampling of produced clusters at a configurable granularity in response to application queries. While Social Trove is a general service, we illustrate its functionality and evaluate it in the specific context of workloads collected from Twitter. Results show that Social Trove supports a high query throughput, while maintaining a low access latency to the produced real-time application-specific data summaries. As a specific application case-study, we implement a fact-finding service on top of Social Trove.

KW - Clustering

KW - Social Sensing

KW - Storage

KW - Summarization

UR - http://www.scopus.com/inward/record.url?scp=84961768118&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84961768118&partnerID=8YFLogxK

U2 - 10.1109/ICAC.2015.47

DO - 10.1109/ICAC.2015.47

M3 - Conference contribution

AN - SCOPUS:84961768118

T3 - Proceedings - IEEE International Conference on Autonomic Computing, ICAC 2015

SP - 41

EP - 50

BT - Proceedings - IEEE International Conference on Autonomic Computing, ICAC 2015

A2 - Lalanda, Philippe

A2 - Kounev, Samuel

A2 - Diaconescu, Ada

A2 - Cherkasova, Lucy

PB - Institute of Electrical and Electronics Engineers Inc.

ER -