Missing information imputation for disease-dedicated social networks with heterogeneous auxiliary data

Xu Liu, Jingrui He, Wanli Min, Hongxia Yang

Research output: Contribution to journalArticle

Abstract

Many high impact applications suffer from missing information. For example, disease-dedicated social networks provide additional resources to glimpse into patients’ daily life related to disease management. However, due to the voluntary nature of such social networks, the information reported by patients is often incomplete, making the following data analytics tasks particularly challenging. On the other hand, in addition to the target data that we aim to analyze, we may also have other related data at our disposal. For example, to analyze disease-dedicated social networks, auxiliary clinical data (with potentially non-overlapping patients), as well as the users’ online social relationship might provide additional information for estimating the missing information. Therefore, the key question we aim to answer in this paper is how we can leverage the heterogeneous auxiliary data for the sake of missing information imputation. To answer this question, we focus on diabetes-dedicated social networks, and we aim to estimate the missing information from patients’ self-reported biomarker measurements. In particular, we propose a hypergraph structure to model the relationship among users and user-generated content (posts). Based on the hypergraph structure, we further introduce an optimization framework to estimate the missing biomarker measurements using heterogeneous auxiliary data. To solve the optimization framework, we design iterative algorithms to find the local optimal solution. Experimental results on both synthetic and real data sets (including a data set collected from a diabetes-dedicated social network) demonstrate the effectiveness of the proposed algorithms.

Original languageEnglish (US)
JournalIISE Transactions on Healthcare Systems Engineering
DOIs
StateAccepted/In press - Jan 1 2020
Externally publishedYes

    Fingerprint

Keywords

  • Disease-dedicated social network
  • heterogeneous learning
  • missing value imputation

ASJC Scopus subject areas

  • Safety, Risk, Reliability and Quality
  • Safety Research
  • Public Health, Environmental and Occupational Health

Cite this