Discriminative and Distinct Phenotyping by Constrained Tensor Factorization

Yejin Kim, Robert El-Kareh, Jimeng Sun, Hwanjo Yu, Xiaoqian Jiang

Research output: Contribution to journalArticlepeer-review

Abstract

Adoption of Electronic Health Record (EHR) systems has led to collection of massive healthcare data, which creates oppor-tunities and challenges to study them. Computational phenotyping offers a promising way to convert the sparse and complex data into meaningful concepts that are interpretable to healthcare givers to make use of them. We propose a novel su-pervised nonnegative tensor factorization methodology that derives discriminative and distinct phenotypes. We represented co-occurrence of diagnoses and prescriptions in EHRs as a third-order tensor, and decomposed it using the CP algorithm. We evaluated discriminative power of our models with an Intensive Care Unit database (MIMIC-III) and demonstrated superior performance than state-of-The-Art ICU mortality calculators (e.g., APACHE II, SAPS II). Example of the resulted phenotypes are sepsis with acute kidney injury, cardiac surgery, anemia, respiratory failure, heart failure, cardiac arrest, metastatic cancer (requiring ICU), end-stage dementia (requiring ICU and transitioned to comfort-care), intraabdominal conditions, and alcohol abuse/withdrawal.

Original languageEnglish (US)
Article number1114
JournalScientific reports
Volume7
Issue number1
DOIs
StatePublished - Dec 1 2017
Externally publishedYes

ASJC Scopus subject areas

  • General

Fingerprint

Dive into the research topics of 'Discriminative and Distinct Phenotyping by Constrained Tensor Factorization'. Together they form a unique fingerprint.

Cite this