Imputing Missing Social Media Data Stream in Multisensor Studies of Human Behavior

Koustuv Saha, Raghu Mulukutla, Kari Nies, Pablo Robles-Granda, Anusha Sirigiri, Dong Whi Yoo, Pino Audia, Andrew T. Campbell, Nitesh V. Chawla, Sidney K. D'Mello, Anind K. Dey, Manikanta D. Reddy, Kaifeng Jiang, Qiang Liu, Gloria Mark, Edward Moskal, Aaron Striegel, Munmun De Choudhury, Vedant Das Swain, Julie M. GreggTed Grover, Suwen Lin, Gonzalo J. Martinez, Stephen M. Mattingly, Shayan Mirjafari

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The ubiquitous use of social media enables researchers to obtain self-recorded longitudinal data of individuals in real-time. Because this data can be collected in an inexpensive and unobtrusive way at scale, social media has been adopted as a 'passive sensor' to study human behavior. However, such research is impacted by the lack of homogeneity in the use of social media, and the engineering challenges in obtaining such data. This paper proposes a statistical framework to leverage the potential of social media in sensing studies of human behavior, while navigating the challenges associated with its sparsity. Our framework is situated in a large-scale in-situ study concerning the passive assessment of psychological constructs of 757 information workers wherein of four sensing streams was deployed-bluetooth beacons, wearable, smartphone, and social media. Our framework includes principled feature transformation and machine learning models that predict latent social media features from the other passive sensors. We demonstrate the efficacy of this imputation framework via a high correlation of 0.78 between actual and imputed social media features. With the imputed features we test and validate predictions on psychological constructs like personality traits and affect. We find that adding the social media data streams, in their imputed form, improves the prediction of these measures. We discuss how our framework can be valuable in multimodal sensing studies that aim to gather comprehensive signals about an individual's state or situation.

Original languageEnglish (US)
Title of host publication2019 8th International Conference on Affective Computing and Intelligent Interaction, ACII 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728138886
DOIs
StatePublished - Sep 2019
Externally publishedYes
Event8th International Conference on Affective Computing and Intelligent Interaction, ACII 2019 - Cambridge, United Kingdom
Duration: Sep 3 2019Sep 6 2019

Publication series

Name2019 8th International Conference on Affective Computing and Intelligent Interaction, ACII 2019

Conference

Conference8th International Conference on Affective Computing and Intelligent Interaction, ACII 2019
Country/TerritoryUnited Kingdom
CityCambridge
Period9/3/199/6/19

Keywords

  • Imputation
  • Multisensor
  • Social media
  • Wellbeing

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Human-Computer Interaction
  • Behavioral Neuroscience
  • Social Psychology

Fingerprint

Dive into the research topics of 'Imputing Missing Social Media Data Stream in Multisensor Studies of Human Behavior'. Together they form a unique fingerprint.

Cite this