Abstract
Many high impact applications suffer from missing information. For example, disease-dedicated social networks provide additional resources to glimpse into patients’ daily life related to disease management. However, due to the voluntary nature of such social networks, the information reported by patients is often incomplete, making the following data analytics tasks particularly challenging. On the other hand, in addition to the target data that we aim to analyze, we may also have other related data at our disposal. For example, to analyze disease-dedicated social networks, auxiliary clinical data (with potentially non-overlapping patients), as well as the users’ online social relationship might provide additional information for estimating the missing information. Therefore, the key question we aim to answer in this paper is how we can leverage the heterogeneous auxiliary data for the sake of missing information imputation. To answer this question, we focus on diabetes-dedicated social networks, and we aim to estimate the missing information from patients’ self-reported biomarker measurements. In particular, we propose a hypergraph structure to model the relationship among users and user-generated content (posts). Based on the hypergraph structure, we further introduce an optimization framework to estimate the missing biomarker measurements using heterogeneous auxiliary data. To solve the optimization framework, we design iterative algorithms to find the local optimal solution. Experimental results on both synthetic and real data sets (including a data set collected from a diabetes-dedicated social network) demonstrate the effectiveness of the proposed algorithms.
Original language | English (US) |
---|---|
Pages (from-to) | 87-98 |
Number of pages | 12 |
Journal | IISE Transactions on Healthcare Systems Engineering |
Volume | 10 |
Issue number | 2 |
DOIs | |
State | Published - Apr 2 2020 |
Externally published | Yes |
Keywords
- Disease-dedicated social network
- heterogeneous learning
- missing value imputation
ASJC Scopus subject areas
- Safety, Risk, Reliability and Quality
- Safety Research
- Public Health, Environmental and Occupational Health