Integrating distance metrics learned from multiple experts and its application in patient similarity assessment

Fei Wang, Jimeng Sun, Shahram Ebadollahi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Patient similarity assessment is an important task in the context of patient cohort identification for comparative effectiveness studies and clinical decision support applications. The goal is to derive clinically meaningful distance metric to measure the similarity between patients represented by their key clinical indicators. It is desirable to learn the distance metric based on experts' knowledge of clinical similarity among subjects. However, often different physicians have different understandings of patient similarity based on the specifics of the cases. The distance metric learned for each individual physician often leads to a limited view of the true underlying distance metric. The key challenge will be how to integrate the individual distance metrics obtained for a group of physicians into a globally consistent unified metric. In this paper, we propose the Composite Distance Integration (Comdi) approach. In this approach we first construct discriminative neighborhoods from each individual metrics, then we combine them into a single optimal distance metric. We formulate Comdi as a quadratic optimization problem and propose an efficient alternating strategy to find the optimal solution. Besides learning a globally consistent metric, Comdi provides an elegant way to share knowledge across multiple experts (physicians) without sharing the underlying data, which enables the privacy preserving collaboration. Our experiments on several benchmark data sets show approximately 10% improvement in classification accuracy over baseline. These results show that Comdi is an effective and general metric learning approach. An application of our approach to real patient data has also been presented in the results.

Original languageEnglish (US)
Title of host publicationProceedings of the 11th SIAM International Conference on Data Mining, SDM 2011
PublisherSociety for Industrial and Applied Mathematics Publications
Pages59-70
Number of pages12
ISBN (Print)9780898719925
DOIs
StatePublished - Dec 1 2011
Externally publishedYes
Event11th SIAM International Conference on Data Mining, SDM 2011 - Mesa, AZ, United States
Duration: Apr 28 2011Apr 30 2011

Publication series

NameProceedings of the 11th SIAM International Conference on Data Mining, SDM 2011

Other

Other11th SIAM International Conference on Data Mining, SDM 2011
Country/TerritoryUnited States
CityMesa, AZ
Period4/28/114/30/11

ASJC Scopus subject areas

  • Software

Fingerprint

Dive into the research topics of 'Integrating distance metrics learned from multiple experts and its application in patient similarity assessment'. Together they form a unique fingerprint.

Cite this